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PREFACE 


IN writing this book our aim has been to cater for those who 
are meeting abstract algebra for the first time. It is our 
experience that such persons, whether they be university 
students, senior school pupils or their teachers, are prone to 
find the unfamiliar concepts difficult to assimilate on first 
acquaintance. We have also found, however, that once a 
student has gained familiarity with the subject he is impressed 
by its simplicity and elegance and that he is inclined to 
wonder why he once found it difficult. One explanation, we 
think, is that until very recently, the emphasis, in what 
might be called “‘school algebra”’ for want of a better name, 
has been placed on the learning of various tricks, formulae 
and routines designed to solve more or less standardised 
problems, rather than on the understanding of the algebraic 
concepts and processes involved. We do not necessarily 
blame the school teachers for this since the conventional 
nature of the examinations by which their pupils are tested 
has obliged teachers to concentrate on routine rather than 
on understanding. Nevertheless, the students who have thus 
been processed are ill-prepared for the abstractions of 
modern algebra and indeed many of them barely recognise 
the subject as being mathematics at all. Why, they ask, 
should they bother to prove that a field has only one zero? 
Drawing on our collective experiences in lecturing on 
abstract algebra to beginners in the Universities of St 
Andrews, Aberdeen, Leeds, and Notre Dame, Indiana, we 
have endeavoured to facilitate the “getting acquainted” 
with abstract algebra by providing numerous examples and 


counter-examples of the principal concepts introduced, such 
v 
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as semi-group, group, ring, integral domain, field, linear 
algebra, et cetera. The experienced algebraist may find 
much of this book, particularly in the earlier sections, un- 
duly longwinded, but he should remember that the book is 
not written for him. A wealth of suitable treatises serves 
his purpose. Every effort has been made to keep the present 
exposition simple and straightforward. 

After careful consideration we decided not to include a 
rigorous account of the real number system since to do so 
would lead us into a discussion of limiting processes, which, 
strictly speaking, lie outwith the realm of algebra. Further- 
more, in the space at our disposal only the bare bones of 
such an account could have been included and to do this 
would have been contrary to the spirit and purpose of the 
volume. Set theory too has been omitted since it is a disci- 
pline in its own right. We expect the reader however, to 
have some intuitive notion of what the words “set” and 
“real number” imply. 

In addition to the illustrative examples, numerous exer- 
cises are provided for the reader. Most of these he or she 
should find straightforward enough but wherever it seemed 
advantageous, hints have been provided either in the text 
or in the solutions to be found at the end of the book. We 
have used the sign © in place of the somewhat outmoded 
contraction Q.E.D. to indicate the completion of the proof 
of a theorem. 

Our thanks are due to Dr I. T. Adamson and Dr T. S. 
Blyth who have offered many useful suggestions and who 
have assisted in reading the proofs. We are especially 
indebted to the publishers, Oliver and Boyd Ltd, and to 
the printers, Robert Cunningham and Sons Ltd, for the 
unfailing courtesy and patience which they have shown 
during the preparation of the book. 

E.M. P; 


D. ER. 
October 1965 
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CHAPTER I 
BINARY OPERATIONS 


§1. Introduction. Elementary algebra is concerned mainly 
with the handling of symbols which represent real numbers. 
These symbols are used so that rules can be derived for the 
solution of numerical problems of various kinds. Such rules 
enable us to collect together problems of the same kind 
under one heading and to give a general method of solution, 
thereby avoiding the separate treatment of individual prob- 
lems. The methods of elementary algebra are most useful in 
cases in which it is not so much the actual numbers involved 
that are important but the relationships between them. 

In abstract algebra, this basic idea of elementary algebra 
is carried a stage further. The symbols used are no longer 
restricted to represent real numbers, but represent elements 
taken from some set S, which may even be unspecified. In 
many problems of abstract algebra, it does not matter what 
these elements are; it is the operations on them and the 
relationships between them which are studied. 

Most of the ideas which we shall consider in this first 
chapter are based on a study of the simple rules of element- 
ary algebra, which are so familiar that we usually take 
them for granted. For example, when multiplying together 
two real numbers a and b it does not matter whether we 
multiply a by b or b by a; the result is the same whichever 
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we do. Symbolically this is expressed as follows: 
ab = ba, 


where ab is interpreted as a multiplied by b and ba as b 
multiplied by a. Again, if we multiply together three real 
numbers a,b,c we can either form the product ab and 
multiply this by c or we can multiply a by bc. We obtain 
the same result whichever method we use. Symbolically 
this is expressed as follows: 


(ab)c = a(bc). 


Through our illustrative examples, we shall see that these 
properties by no means characterise real numbers; in other 
words, there are quite different algebraic systems in which 
they hold. Results which depend solely on one or both of 
the properties will be true in all such systems irrespective of 
the nature of the elements. Through a study of ideas of this 
kind we are able to generalise some of the notions of ele- 
mentary algebra. Such generalisations have two important 
effects: they economise by bringing together results from 
different systems and they lead to a better understanding of 
methods, because inessential details, which sometimes ob- 
scure the issue, are removed. 


§2. Notations and terminology from set theory. It is 
assumed in this book that the reader is familiar with the 
concept of aset. From time to time we shall consider various 
special sets, in particular those encountered in elementary 
algebra. We shall denote the set of integers (positive, nega- 
tive or zero) by Z, the set of positive integers (by positive we 
mean strictly greater than zero) by P, the set of real numbers 
by R and the set of rational numbers by Q; by a rational 
number we mean a real number of the form a/b, where a is 
an integer and b is a positive integer. 
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If x is an element of a set S, we write xe S. If x does 
not belong to S, we write x ¢.S. For example, 1 € P, 0¢ P, 
£€Q, /2¢Q, /2ER, ze R, /-1ER. 

We use the symbol = to mean “implies that”. For 
example, if x e R, then 


x42x4+1=0 = x=. 


We recall that a set T is a subset of the set S if every 
element x of T is also an element of S: that is, if 


Sees Ss sey 


If T is a subset of S, we write ToS or SDT. 

We recall also that if S, and S, are two sets, then a 
mapping f of S, into S, associates with each element x of 
S; a unique element f(x) of S,. We shall denote a mapping 
f of S, into S, by f: S,; + S, whenever it seems necessary 
to mention S, and S, explicitly. 

If every element y of S, is of the form f(x) for some 
xe S,, we call fa mapping of S, onto S,. 

In general, different elements x,, x, of S, can be associ- 
ated with the same element of S,. If, however, fis a mapping 
of S$, into S, such that 


I(x) =f(%2) > 1 = X2, 


then f is called a one-one mapping or a one-one correspond- 
ence. 

If f:.S,; — S, is a one-one mapping of S, onto S), then a 
reciprocal one-one mapping g:S, 7 S, is defined by 
g(f(x)) = x. To see this, we observe that any element y of 
S, is of the form f(x) for some x € S, and that f(x) is an 
element of S, for every x € S;. It follows that g is certainly 
a mapping of S, onto S;. Now g()1) = %1 if yy = f(%1) 
and g(y2) = x2 if yz = f(x2), so 
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801) = 802) > X= X2 => S(%1) = f(x.) > V1 = y2- 


Consequently g is a one-one mapping. Clearly the map- 
pings g and f are inverses of each other and for this reason 
it is customary to write g = f~ 1 and f = g~’. This notation 
must be interpreted, however, in the purely formal sense 
that if x is an element of S, and y is the corresponding 
element of S,, then 


ff) = gs) = x, 
S(f~'O)) = f(gQ)) = f@) = y. 
We emphasise that f~1(x) does not mean (f(x))7?. 


Example 2.1. Let R, denote the set of real numbers x 
such that x20. Then f(x) = x? determines a mapping of 
R onto R,, which is not a one-one mapping. 


Example 2.2. Let f: R > R be the mapping defined by 
S(x) = x+1. Then fis a one-one mapping of R onto itself. 


§3. Binary operations on a set. In any algebraic system 
we are concerned with three basic concepts—elements, oper- 
ations and relations. For instance, in the formula 2 x3 = 6, 
the integers 2, 3, 6 are elements, in this case all belonging to 
P. The sign x indicates an operation, that of multiplica- 
tion, and the equality sign = denotes a relation which 
relates the number 2 x3 to the number 6. In the case illus- 
trated, the elements, the operation and the relation are all 
familiar, but one of the objects of the present work is to 
consider new types of elements, new operations and new 
relations. Thus a typical algebraic formula might read 
anb~c in which the elements a, b, c belong to some set, 
possibly, though not necessarily, one of the familiar sets 
mentioned in the previous section; the operation A might 
stand for x , or for + , or for + , or for some new opera- 
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tion still to be defined, while the relation ~ might denote 
=, or >, or | or some other type of relation. 

We begin by considering what are known as binary oper- 
ations, in which two elements x, y are combined to form a 
third element x A y in some prescribed way. Other types of 
operation, such as ternary operations in which three ele- 
ments are combined to form a fourth, are of interest, but in 
this book we shall be concerned only with binary operations. 
Of particular interest is the case where x, y and xy all 
belong to the same set S. For such an operation, the intui- 
tive concept is that of combining two elements x, y of S to 
give a third element x A y which is also in S. Two important 
aspects must be stressed. First, the order of the elements 
x, y is important. (We recall, for instance, that x—y is not 
the same as y— x for real numbers.) We therefore regard 
the operation as producing the element xAy out of the 
ordered pair (x, y) of elements of S. Secondly, we do not 
demand that xa y be defined for every ordered pair (x, y). 
(For instance 1 +0 has no meaning in R; for subtraction in 
P,2—3¢P.) 

To make these ideas precise, we use the following defin- 
ition. A binary operation in a set S is a mapping of a 
subset of the set, which we denote by S”, of all ordered pairs 
(x, y) of elements of S, into S itself. For such a mapping, 
we denote the element of S corresponding to (x, y) by xA y3; 
we use the symbol A to denote the binary operation itself. 
Different binary operations in S correspond of course to 
different mappings, and we can say nothing about the pro- 
perties of any particular operation until we state the rules 
by which the appropriate mapping is constructed. 

By a closed binary operation A in S we mean one in 
which the whole of S? is mapped into S: that is, x Ay is 
defined and is in S for all ordered pairs (x, y). It 1s some- 
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times convenient to say that the set S is closed with respect 
to the binary operation A whenever this condition is satis- 
fied. For example, P is closed with respect to x and + but 
not with respect to +. Again, P is not closed with respect 
to subtraction, but Z is closed with respect to subtraction. 

Mathematicians show more inventiveness in creating 
new operations than in finding symbols to represent them 
and it is common practice to use the same symbol, say x, 
for different operations in different sets. The reader is there- 
fore cautioned against assuming that operations in two 
different sets which are represented by the same symbol obey 
the same laws in both sets. For instance, although 
xxy = yxxvand-xax(y xz) = (xoey)eezvare valid when 
x, y, z belong to the set R and x means ordinary multipli- 
cation, it must not be taken for granted that they are valid 
when x, y, z belong to some other set in which an operation 
x has been defined. 


Example 3.1, Addition, multiplication, subtraction and 
division all determine binary operations in R; in the case of 
addition, for example, the number x+y corresponds to the 
ordered pair (x, y). In the case of division, the number x+y 
corresponds to the ordered pair (x, y) provided that y40; 
we do not associate any number with an ordered pair of the 
form (x, 0) in the case of this operation, which is therefore 
not closed in R. 


Example 3.2, Given x, y € R, write 
XA =r ey) 


(where the symbol ,/ denotes the non-negative square root). 
This determines a closed binary operation in R; on the other 
hand 


xAy = J(Qx?—y?) 
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determines a binary operation which is not closed in R, 
because 2x? — y” does not necessarily have a real square root. 
(Many of the operations that we can define in this way are 
of no particular interest, except as illustrations. Algebraists 
are concerned mainly with operations for which there is 
some definite motivation.) 


Example 3.3. Let S be any set and let M be the set of all 
mappings of S into itself. If f, ge M, we define the com- 
position product /. g to be the mapping h: S > S such that 


h(x) = f(g). 


The mapping in which f o g corresponds to (f, g) is a closed 
binary operation in M. 


§4, Equivalence relations. The relations in common use 
in elementary algebra are comparatively few, the most famil- 
iar being equality = and various forms of inequality, >, 
=, <, S. Even in the case of = we must consider carefully 
whether we understand what it means. For instance in the 
equations 1+1 = 2 = 4 we attach slightly different mean- 
ings to the two equality signs. If I visit the grocer to buy 
two eggs I will raise no objection if he gives me first one and 
then another one but I am likely to object if he gives me four 
half eggs. Indeed, while 1+1 and 2 are both elements of P, 
the number ¢ is, strictly speaking, an element of Q. To be 
precise we should only relate, or in this case equate, two 
elements of the same set. The second of the above equations 
would therefore be more accurately expressed by writing 
2 = 4 and its meaning depends upon the definition of 
equality in Q, which is not quite the same thing as the 
definition of equality in P. These points will be considered 
in greater detail at a later stage but they are mentioned here 
by way of a caution. Just as products must be defined in 
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each set in which they are employed, so must the several 
relations such as =, =, > and others. 

As in the case of operations, we shall restrict our atten- 
tion here to the case of binary relations. Intuitively, the idea 
of a binary relation in a set S is based on some process of 
** comparison ”’, involving ordered pairs (x, y), where x, ye S. 
For example, in the set R the comparison might be expressed 
in terms of the magnitudes; for the ordered pair (x, y), 
where x, yeé R, we may enquire whether y is numerically 
greater than x. Again, in the set of triangles in the Euclidean 
plane, we might compare y with x in an ordered pair (x, y) 
by asking whether triangle y is similar to triangle x. From 
the first example, it is clear why we must consider ordered 
pairs; for| y| > | x |iscertainly not the sameas| x | > | y |. 
(We define | x | to be x if x > 0 and to be —x if x < 0.) 
It can be seen that the process involved is one which singles 
out certain ordered pairs and rejects the others. Thus, given 
an ordered pair (x, y), where x, ye S, we regard (x, y) as 
significant if y compares with x in the appropriate way and 
not significant otherwise. 

To make these ideas precise, we use the following defin- 
ition. A binary relation in a set S is a subset of the set S? 
of all ordered pairs (x, y), where x, y are elements of S. Let 
R denote a binary relation. According to our definition, 
R is a subset of S?. We shall also use the symbol R in 
the following way: we shall write xRy to mean the same 
thing as (x, y)eé R. (A modified form of this notation is 
sometimes used, for example as in Example 4.1 below.) 

There are various different types of binary relations, but 
we shall be concerned mainly with those known as equival- 
ence relations. An equivalence relation in a set S is a binary 
relation R in S such that the following conditions are 
satisfied : 
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(1) xRx forallxe S (reflexivity), 
(2) xRy and yRz imply that xRz _ (transitivity), 
(3) xRy implies that yRx (symmetry). 


Example 4.1. If n,,n,¢Z and k is a fixed integer, we 
write m, =n, (mod k) whenever n,—n, is divisible by k. 
The relation = is then an equivalence relation in Z. 

To prove this, we must verify conditions (1), (2) and (3) 
in turn. To verify (1), we observe that n =n (mod k) for 
all ne Z, since n—n = 0, which is trivially divisible by k. 
To verify (2), suppose that n, = n, (mod k) and n, =n, 
(mod k). Then nm, —n, is divisible by k and n,—n, is divis- 
ible by k. Hence n,—n, = Ak and n,—n; = wk, where 
A, ware integers. Therefore n,—n, = (ny —n.)+(n,—73) = 
(A+wk and so n,—n, is divisible by k, which shows that 
ny, =n; (mod k). Finally to verify (3) we start by assuming 
that n, =n, (mod k). Then n, —n, is divisible by k. Hence 
Nz, —N, is also divisible by k and so n, = n, (mod k). 

Example 4.2. Let T be the set of all triangles in Euclid- 
ean space of two dimensions. If t,, t, ¢ 7, write t,~t, if 
t, is similar to ¢t,. Then ~ is an equivalence relation in 7. 

Example 4.3. Let P be the set of all people. If p,, p, € P, 
write p,Bp, if p, and p, have their birthdays on the same 
day of the year. Then B is an equivalence relation in P. 


Equivalence relations are of frequent occurrence in 
modern mathematics and play an important part in many 
theoretical problems. Through the idea of an equivalence 
relation we can formulate the idea of classification of ele- 
ments in a set. Suppose that we have an equivalence 
relation R in a set S and that, for a given element a € S, we 
consider the set A of all elements x € S such that aRx. From 
conditions (1), (2) and (3) for an equivalence relation, it 
follows quickly that given any two elements x, ye A we 

EAA B 
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have xRy and also ae A. We call the set A the equivalence 
class of S containing a. Given any element } € S such that 
b¢ A, it is not true that aRb and in fact we can easily see 
that bRx is false for all xe A. But, in general, there are 
other elements of S which are related to b by R and the set 
of all these elements forms the equivalence class B of S 
containing b. In fact the relation R divides S into mutually 
exclusive classes, each consisting of all elements related to 
any particular element of the class. Further, two elements 
aand b lie in the same equivalence class if and only if aRb. 
In this way we can classify the elements of S into different 
types. In certain problems the individuality of the elements 
under consideration is not important; it is only the type of 
element that matters. In other words, we are satisfied if 
we know to which equivalence class an element belongs, 
even if we cannot identify the element itself. When this 
occurs, we say that we know the element “to within the 
equivalence R”’. 


§5. Stability of an equivalence relation with respect to a 
binary operation. Let S be a set in which there is given a 
closed binary operation A and an equivalence relation R. 
Let S* be the set of all equivalence classes in S with respect 
to R. In general, there are many ways of defining binary 
operations in S*. There is, however, one possible way which 
suggests itself when we bear in mind that the elements of S* 
are classes of elements of S and that in S we already have a 
binary operation A. Suppose that A, B are elements of S§*. 
Let us choose an element x of S such that xe A and an 
element y of S such that ye B. Then xy is defined; it is 
an element of S and so belongs to some equivalence class 
with respect to R. Let us call this equivalence class C. Then 
C depends not only on A and B; it depends also on the 
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choice of x and y. For suppose that we choose two other 
elements x,, y, of S, where x, € A and y, € B. If we form 
x, Ay, then there is no guarantee that x, Ay, is in the 
equivalence class C. In order to obtain a binary operation 
in S* that does not depend on the choices of the individual 
elements of S, we must have xa y and x, Aj, in the same 
equivalence class. For arbitrary A and R in S, the condition 
which we require does not necessarily hold. We therefore 
focus our attention on those cases in which it does hold and 
are thus led to a study of binary relations and operations in 
S for which the following is true: 


if xRx, and yRy,, then (xAy) R(x, Ay;). (5.1) 


Given an equivalence relation R in S and a binary oper- 
ation A in S, we say that R is stable with respect to A if 
(5.1) is satisfied. Then we can uniquely determine a binary 
operation A * in S* by the process described above; we call 
this the binary operation induced by A. This process of 
defining a binary operation in a set of equivalence classes is 
of fundamental importance. It must always be remembered 
that the stability of the equivalence relation R with respect 
to A requires to be verified. 

It is sometimes useful to denote the equivalence class of 
S containing x by x*. Thus 


x* = y* if and only if xRy. 


Then, if R is stable with respect to A, the induced binary 
operation A* in S* is given by 


me pt = (Ay). (5.2) 


The following result is sometimes useful for checking 
whether an equivalence relation is stable. 
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THEOREM 5.1. An equivalence relation R defined in a set 
S is stable with respect to a binary operation « in S if for 
every a, b,c € S, we have 


aRb = (anc) R(bac)and (cAa) R(cAb). (5.3) 


Proof. Suppose that condition (5.3) is satisfied. 
If aRa’ and bRb', then we have (aAb) R(a’' ab) and 
(a’ Ab) R(a' ab’). It follows from the transitive property 
of an equivalence relation that (aA b) R (a’ ab’). © 


Example 5.1. Let n; and n, be integers. Write n, = n, 
(mod 6) whenever 7,, 7, leave the same remainder on divi- 
sion by 6 (see Example 4.1). This determines an equivalence 
relation in the set Z of all integers; for this relation there 
are six equivalence classes, consisting respectively of the 
integers which leave remainders 0, 1, 2, 3,4, 5 on division 
by 6. Thus the set Z* of equivalence classes has only six 
distinct elements, which we denote by 0*, 1*, 2*, 3*, 4*, 5*. 
Ordinary addition + is a binary operation in Z and the 
relation = (mod 6) is stable with respect to this operation. 
The induced binary operation +* in Z* can be specified 
completely by means of an “‘ addition” table. This can be 
written as follows: 


So 
— 
No 
Ww 
> 
WG 


mA hwWN = © 
MWh WN — © 
OM k WN = 
-—-OoOMnN db wh 
NK ON kW 
wn On 
hWN- ON 


For convenience, we have omitted the stars on the entries 
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in this table; in it 0, 1, 2, 3, 4, 5 should not be interpreted as 
integers but as equivalence classes mod 6. The table is used 
as in the following example: the entry in the row labelled 3 
and the column labelled 4 corresponds to 3* +* 4*. Thus 
Seed 4 1%) 


Example 5.2. In the set P of all positive integers, let R 
be the equivalence relation = (mod 6) defined as in the 
previous example. Let A be the binary operation in P 
defined by 


Ny An, = the highest common factor of n, and n3. 


Then R is not stable with respect to A in this case, for we 
have 


DR 20: ah 35; 
but 2A5 is not equivalent to 20A35 since 2A5 = 1 and 
pons = 5. 


§6. Commutative binary operations. Suppose that A is 
a binary operation in a set S such that if x A y is defined then 
so is yAx and 


YAX=XAY. (6.1) 
Then a is called a commutative binary operation. 


Example 6.1. In the set of real numbers, addition and 
multiplication are both commutative. Subtraction and divi- 
sion are not commutative. The operation A defined by 


xay=|x-y| 
is commutative. 


Many of the binary operations which we shall study are 
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not commutative. One such is the important operation 
of matrix multiplication, which is useful in many branches 
of mathematics. For the beginner, one of the confusing 
aspects of this operation is the fact that multiplication is 
not commutative. The trouble is connected not so much 
with the concept of a non-commutative operation (as indic- 
ated above, one of the simple operations of elementary 
mathematics, subtraction, is non-commutative) but with the 
fact that the same notation and terminology are used for 
both matrix multiplication and multiplication of real num- 
bers. This leads to mistakes which are caused by using rules 
automatically without thinking whether they are valid in 
the system under consideration. 

If A is a closed binary operation in a set S and R is an 
equivalence relation in S stable with respect to A, then the 
induced binary operation A* in the set S* of equivalence 
classes is commutative whenever A is commutative. For by 
(5.2) we have, since A is commutative, 


x* AP y* = (XAPM SHAR Hn se 


However if A* is commutative, it is not necessarily true 
that A is commutative. For suppose that R is the binary 
relation in a set S such that (x, y) e R for all x, ye R. Then 
R is an equivalence relation; there is only one equivalence 
class, namely S itself and R is stable with respect to any 
binary operation A. In this case the induced binary opera- 
tion A* is clearly commutative, but A need not be. 

The operation of forming the composition product of 
two mappings (see §3) is, in general, non-commutative. 
Suppose that S is a set containing more than one element 
and let M be the set of mappings of S into itself. Given 
f,g¢M we can define the composition product fog and 
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so we have a closed binary operation . in M. Let x and y 
be distinct elements of S, let fbe an element of M such that 
J(x) = x, f(y) = x and let g be an element of M/ such that 
g(x) = y. Then 


(fog) (x) = f(g(x)) = f(y) = x, 
(g of) (x) = g(f()) = a(x) = y. 


Hence f.g # go fand so « is not commutative in M. 


§7. Associative binary operations. Suppose again that A 
is a binary operation in a set S. By (x A y) Az we mean the 
element uAz, where u = xAy and by xA(yAz) we mean 
xAv, where v = yAz. Suppose that A is such that when- 
ever one of the elements (x A y) Az and x A(y Az) is defined 
then so is the other and they are equal. Then A is called 
an associative binary operation. In particular, a closed 
binary operation in S is associative if and only if 


(LAV) AZ = XA) AZ) C71) 
for all x, y,zeES. 

Example 7.1. In the set of real numbers, addition and 
multiplication are associative operations, but subtraction 
and division are not associative. 

Example 7.2. In the set M of all mappings of a set S 
into itself, the composition product determines an associat- 
ive binary operation. For suppose that f, g,he M. Then 
(fo g)ch is the mapping j.A, where j = fog. Then for 
every xe S, 

(jo A) (x) = JA) = (Fo 8) (A) = fg). 
Also fo (gf) is the mapping fo k, where k = goh. We 
have 


(fo k) (x) = AA) = £8 0 AY) = Se). 
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It follows that (fog) oh = fo(g oh). 

If A is an associative binary operation, then in any 
compound expression formed from elements of S by 
repeated application of A it is unnecessary to use brackets. 
Thus, if w;, x,y, ze S, then 


((WAX)AP)AZ 
can be written 
WAXAYAZ 


without ambiguity; the property expressed in (7.1) ensures 
that all possible combinations of w, x, y, z in this order give 
rise to the same element of S; thus, for example, 


(WAX)AYJAZ = (WAX)A(YA2). 


It should be remembered, however, that the order in which 
the elements are written must not be disturbed, unless the 
operation is also commutative. 

On the other hand, brackets must be retained in the 
case of an operation which is non-associative, or else some 
other convention adopted in which the notation is unambi- 
guous. It is for these reasons that, in dealing with real 
numbers, brackets are sometimes unnecessary but at other 
times are indispensable. Thus, if w,x,y,zeR, we can 
write 

(w+x) + (y+z) 
as 
Wiel Net ea) atane 


without ambiguity, but 
CH= x) =z) 
is not the same as 
Ww-x-y-z. 
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The latter is conventionally interpreted as ((w—x)—y)—z 
which, in general, is different from (w—x) — (y—z); this 
is a consequence of the fact that subtraction is not an asso- 
ciative operation. 


Exercise 7.1. Let S be any set and define a closed binary 
operation A by 
XAY=*x. 


Determine whether A is associative. 
Exercise 7.2. Define closed binary operations A in R by 
(i) xay=|xty|, 
fijeecny = |x| +] y), 
(iii) xAy = x?+y?. 
Determine which of these operations are associative. 


Exercise 7.3. Determine the possible pairs of real num- 
bers a, 6 for which the binary operation defined in R by 


xAy = ax+by 
is associative. 


Exercise 7.4. If A is a closed binary operation in a set 
Sand if Ris a stable equivalence relation in S, prove that the 
induced binary operation A* in the set S* of equivalence 
classes is associative whenever A is associative. 


§8. Identity elements. An element e in a set S is called 
an identity element with respect to a binary operation A in 
S if, whenever x Ae is defined, we have 


xA€@=xX (8.1) 
and whenever eA x is defined we have 
CAX =X. (8.2) 
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Example 8.1. The element 0 is an identity with respect 
to addition in the set R of real numbers; the element | is an 
identity with respect to multiplication in R. 


Example 8.2. With respect to the operation a defined by 
xAy =X, 
the set R has no identity element. 


If S is not closed with respect to A, then it may happen 
that there is more than one identity element in S (as in an 
important case which we shall consider later: see §23). How- 
ever, for closed binary operations, we have the following 
result. 


THEOREM 8.1. Let A be a closed binary operation in a set 
S. If there exists an identity element e in S, then e is unique. 


Proof. Suppose that e’ is any identity element in S. 
Then, by (8.1), we have 


eAe=e' 
since e is an identity element. But, since e’ is an identity 
element, we have by (8.2) 

e'Ae = e, 


Hence e’ = e, so that e is the only identity element in S 
with respect to A. © 


The proof of this theorem breaks down for binary oper- 
ations which are not closed, because in this case e’ Ae need 
not even be defined. 


§9. Inverse elements. Suppose that a, is a closed binary 
operation in a set S and that S has an identity element e. 
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Given an element x € S, we say that an element y € S is an 
inverse for x with respect to A if 

xXAY =e (9.1) 
and 

pax =e. (9.2) 
We notice that if y is an inverse of x, then x is an inverse of y. 


Example 9.1. Every real number x has an inverse with 
respect to addition, namely the number —x. Every real 
number x other than 0 has an inverse with respect to multi- 


plication, namely the number 1/x or x71. 


Exercise 9.1. Let A be the closed binary operation in R 
defined by 


XAY = x+y—2x7y?. 


Show that A is not associative and that 0 is an identity 
element; show also that every non-zero element x of R such 
that x> —1 has two inverses but that if x< —4, then x has 
no inverse. 


The preceding exercise shows that an inverse of an ele- 
ment, if it exists, is not necessarily unique. However, the 
following result shows that, in the case of closed associative 
binary operations, an element cannot possess more than one 
inverse. 


THEOREM 9.1. Let A be a closed associative binary opera- 
tion in a set S and suppose that there is an identity element e 
in S. If x, y, z are elements such that 


Shy =e 
and 

Zine; 
then 
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Proof. Since e is an identity element, 


(ZAX)Ay=eaAy=y 
and 
ZA(XKAY) = ZAe€ = Z. 


But, by the associative property of ~, we have 
(ZAX)AY = ZA(XAP) 
and so y = z. © 
We conclude this section by proving a result which shows 
that, for a closed associative binary operation, the set of 


elements which possess inverses is closed with respect to the 
operation. 


THEOREM 9.2. Let A be a closed associative binary opera- 
tion in a set S. Suppose that there is an identity element e in 
S and that x, y are elements of S possessing inverses x’, y’ 
respectively. Then x Ay has an inverse, namely the element 
pha 


Proof. By the associative law, 
(XAYJAQY AX) = (KAV)APIAN = WALPAY) AX 
and so, since yAy’ = e and xAe = x, we have 
(XA PAY AX’) = XAX =e. 
Similarly (y' Ax’) A(xAy) = e. © 


The rule for forming the inverse of x Ay given by this 
theorem is known as the reversal rule. Notice carefully that 


x’ Ay’ is not necessarily the same as the inverse y’ Ax’ of 
XAY. 


§10. The distributive laws. In several important alge- 
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braic systems, two binary operations are involved. Suppose 
that S is a set with two closed binary operations A and v. 
If 
(XAY)VZ = XVZ)A(QVV2Z) (10.1) 
XV(YAZ) = (XVY)A(XV2Z) (10.2) 


for all x, y, z € S, we say that the operation v is distributive 
with respect to (or over) the operation A. In general, the 
properties expressed in (10.1) and (10.2) are independent of 
one another and in certain algebraic systems one of the 
properties is satisfied but not the other. 

It should be noticed that (10.1) and (10.2) are not sym- 
metrical with respect to the two operations. Thus if v is 
distributive with respect to A, then it is not necessarily true 
that A is distributive with respect to v. 


Example 10.1. Multiplication in R is distributive with 
respect to addition, but addition is not distributive with 
respect to multiplication. 


Exercise 10.1. Prove that when v satisfies the com- 
mutative law (6.1), condition (10.2) holds if and only if 
condition (10.1) holds. 


§11. Additive and multiplicative notations. In the above 
sections, we have consistently used the symbol A to denote 
a general binary operation. Wherever we have referred to 
the particular binary operations of addition and multipli- 
cation in the case of real numbers, we have denoted these 
in the usual way: addition by the symbol + and multipli- 
cation by writing elements next to one another or by using 
the symbol x. 

In the case of the operation + in the set of real numbers, 
we denote the identity element by 0 and the inverse of the 
element x by —x. This notation is a convenient one to use 
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for other binary operations; it is known as the additive 
notation. Thus if we use the additive notation for a given 
binary operation, we denote the operation by +, the identity 
element (if it exists) by 0 and the inverse of x (if it exists) by 
—x. When we use the additive notation, we may refer to 
the element x+y as the sum of x and y. We call the element 
0 the “* zero ’ element and use the term “‘ minus x ’’ for —x. 
In additive notation, the commutative law (6.1) is written 


x+y =ytx 
and the associative law (7.1) is written 
(xt+y)+z = x+(+2); 
equations (8.1) and (8.2) become 
Bee lo comer Oe ae gency 
and equations (9.1) and (9.2) become 
x+(—x) =0, (-—x)+x =0. 


It should be remembered, however, that the commutative 
and associative laws are not necessarily satisfied. Although 
they do hold in many of the cases in which we use the 
additive notation, we reserve the right to use this notation 
whether the properties are satisfied or not. When using the 
additive notation, it is convenient to introduce the idea of 
“subtraction” by defining x—y to be x+(—y). 

In the case of multiplication of real numbers, we denote 
the identity element by 1, and the inverse of the element 
x(x #0) by x~'. Again, we use this notation for other 
binary operations: it is known as the multiplicative notation. 
Thus, in using the multiplicative notation, we denote the 
element corresponding to the ordered pair (x, y) in the 
binary operation by xy, the identity element (if it exists) by 1 
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and the inverse of x (if it exists) by x71; however in certain 
cases it is preferable to use some symbol such as e or J 
rather than the figure 1 to denote the identity element. Using 
this notation, we refer to the element xy as the product of 
x and y. We call the element 1 the unity. Equations 
(6.1) and (7.1) are written 


xy = yx, 
(xy)z = x(yz) 

respectively; (8.1) and (8.2) are written 

ie ix = Xx 
and (9.1) and (9.2) are written 

yee 1, x tx = 1, 

Again, it is not necessarily true that the commutative and 
associative properties hold. 

The multiplicative notation is probably the commonest 
in use for systems which involve one binary operation. In 
dealing with systems involving two binary operations we 
commonly use the multiplicative notation for one of them 
and the additive notation for the other. When this is done, 


the conditions for multiplication to be distributive with 
respect to addition become 


(x+y)z = xz+y2z, 
x(y+z) = xy+-xz. 

Exercise 11.1. Let S be a set and let + be a closed 
binary operation in S. Suppose that there is an identity 
element 0 with respect to + and that each element x € S has 
a unique inverse —x in S. Prove that 

x-y=0>x=y 


if + is either commutative or associative. 


CHAPTER II 
GROUPS 


§12. Introduction. By itself, the idea of a binary opera- 
tion is too weak to lead to a fruitful study of general 
algebraic systems; it is only when additional conditions are 
imposed that we can expect an interesting theory. The more 
important additional conditions—commutativity, associa- 
tivity, the existence of identity elements and inverses and the 
distributive laws—have already been mentioned in Chapter I. 
In the present chapter and in Chapter III we develop the 
early stages of a systematic study of systems which involve 
binary operations satisfying some of these conditions. 

The actual choice of restrictions is influenced by two 
main considerations—whether they lead to interesting 
theories and whether they are representative of systems 
which arise naturally. Regarding the second of these con- 
siderations, it will be observed, for example, that our atten- 
tion is confined mainly to associative binary operations. 
This is because the ideas are influenced by a study of systems 
in which we take “‘sums”’ or ‘“ products ’’ of elements 
according to some natural rule, and it is nearly always the 
case that such sums or products satisfy the associative law. 
(There are exceptions to this, notably in the study of vector 
products and in genetics; multiplication in the genetical 
sense is, in general, non-associative.) 

24 
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The present chapter is concerned with systems which 
involve just one binary operation. By far the most important 
systems of this type are those known as groups. The theory 
of groups has been widely studied over the past fifty years 
and it is still an active field of research. Many of the theo- 
rems of group theory are mathematically attractive and 
elegant, but it is not for this reason alone that groups are 
studied: certain parts of the theory are important from the 
point of view of application to physical theories. 

Before defining groups and discussing some of their 
elementary properties, we define what is meant by a semi- 
group. Here the restrictions imposed are weaker, so that we 
cannot expect so rich a theory. Nevertheless semi-groups 
are of sufficiently common occurrence to justify their study 
as a separate line of development. 


§13. Semi-groups. A semi-group is a system consisting 
of a set S together with a closed associative binary operation 
ain S. 

It is convenient to refer to the set S itself as the semi- 
group. This can always be done whenever there is no danger 
of confusion concerning the binary operation A ; if more 
than one binary operation is possible, then we can say that 
S is a semi-group with respect to A. Although this language 
is less precise than that used in the definition at the beginning 
of this section, in practice it is often more satisfactory and 
seldom leads to misunderstanding. 


Examples. The sets Z, P, R, Q are semi-groups with 
respect to addition and are also semi-groups with respect to 
multiplication. 


A detailed account of the properties of semi-groups is 
EAA C 
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beyond the scope of this book.t The definition of semi- 
group is included so that the reader can become familiar 
with a term which is frequently encountered in modern 
algebra; other instances of a similar nature occur elsewhere 
in the present volume. 

Before passing on to groups, however, the following 
simple theorem on semi-groups should be noted. 


THEOREM 13.1. Jn a semi-group S there is at most one 
identity element. If S is a semi-group having an identity 
element, then each element of S has at most one inverse. 


Proof. The first part of the theorem follows immediately 
from Theorem 8.1, since the binary operation in S is closed. 
The second part follows from Theorem 9.1, since the binary 
operation in S is closed and associative. © 


The phrase “at most one” occurring in each part of 
Theorem 13.1 is essential. A semi-group does not necess- 
arily possess an identity element; for example, the set of 
even integers is a Ssemi-group with respect to multiplication, 
but has no identity element. Again, if S is a semi-group 
having an identity element, then it is not necessarily true 
that each element of S has an inverse. For example, Z is a 
semi-group with respect to multiplication, but no integer 
other than | and —1 has an inverse in Z with respect to 
multiplication. 


§14. Groups. A group is a semi-group S possessing an 
identity element and such that every element in S has an 
inverse in S. Thus a group is a set S together with a binary 
operation A satisfying the following conditions: 

t The interested reader is referred to the book The algebraic theory 
of semi-groups by A. H. Clifford and G. B. Preston, but this book 


should not be tackled until the reader has become familiar with the 
basic ideas and notations of modern algebra. 
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(i) S is closed with respect to A, 
(ii) A is associative, 
(iii) S contains an identity element with respect to A, 
(iv) every element of S has an inverse in S with respect 
to A. 


As indicated in §13, it is often convenient to express this 
more loosely by saying that S is a group with respect to A 
if the above four conditions are satisfied. 

We observe that it follows immediately from Theorem 
13.1 that a group has exactly one identity element and that 
each element of a group has exactly one inverse. 


Example 14.1. The set Z is a group with respect to 
addition, but P is not a group with respect to addition, 
because it has no identity element. The set R is a group 
with respect to addition, but R is not a group with respect 
to multiplication since the element 0 has no inverse. The 
set of non-zero elements of R does form a group with 
respect to multiplication. 

Example 14.2. Let S be a set consisting of four elements 
e, a, b, c and define a binary operation A in S by means of 
the table 


ee a Ds ¢ 
e en a bo Ve 
a @ e€ -c +b 
b b cea 
Cc c b ae 


This table should be interpreted in the same way as the table 
in §5. It can be verified directly that S is a group with 
respect to the binary operation A defined in this way (see the 
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end of §19). The identity element is e, and the inverse of 
each element is the element itself. 


Example 14.3. Let OX, OY be rectangular Cartesian 
axes in two-dimensional Euclidean space and let t, be the 
rotation about O which gives new axes OX’, OY’ such that 
OX’, OY’ are inclined at an angle «(—z<aSz) to OX,OY 
respectively. Then t, is represented by the transformation of 
coordinates 


f 


x’ = xcosa+ysin a, 
, 


y’ = —xsina+y cosa. 


If we first carry out the transformation ¢, and follow this by 
the transformation t,, then we call the combination of these 
two transformations the product of ft, and t, and denote it 
by tt, This determines a closed binary operation in the 
set of rotations about O. In fact fat, is one Of t14.8, tat p+2n 
ta+p—2n» depending on the values of « and f. It is not diffi- 
cult to verify that the set of all such transformations is a 
group, in which the identity is fg and the inverse of f, is t_, 
if a # 7; the inverse of f, is t, itself. 


The above examples should be sufficient to illustrate the 
fact that there are very different kinds of groups. In Ex- 
ample 14.2, we have a finite group}: that is, a group in 
which the underlying set has a finite number of elements. 
Any particular finite group can be illustrated by means of a 
table as in Example 14.2; the table in §5 illustrates a finite 
group with six elements. The number of distinct elements 
in a finite group is called the order of the group. A group 


t The theory of finite groups is the subject of a separate volume in 
the present series, An Introduction to the Theory of Finite Groups by 


W. Ledermann, which the reader should consult for a detailed ele- 
mentary account, 
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such as that in Example 14.3 which is not finite is said to be 
of infinite order. 

One property which all the above examples have in 
common is that the binary operations which determine the 
groups are commutative. A group in which the binary 
operation is commutative is known as an abelian group. 
Our first example of a group which is not abelian occurs in 
§15 below. 

There are several alternative definitions of a group each 
of which is equivalent to the definition given at the begin- 
ning of this section. One such is suggested by the following 
theorem, which we include because of its connection with 
the alternative definition and because the proof gives an 
illustration of the kind of argument with which the reader 
should become familiar. 


THEOREM 14.1. Let S be a set and let a be a closed 
associative binary operation in S. Then S is a group with 
respect to A if and only if the following condition is satisfied: 
(a) for every pair of elements a,beéS, there exist elements 
x, ye S such that 


anx=b, yaa=b. 


Proof. Suppose that S is a group with respect to a. 
Then S has an identity element e and a has an inverse a’. 
Then 


ad(a' ab) = (aaa)ab=eab=b 


and so the equation aA x = b has a solution x = a’ Ab. In 
a similar manner, we can prove that there is a solution of 
the equation yAa = b, namely y = baa’. It follows that 
if S is a group with respect to A, then (a) is satisfied. 
Suppose conversely that (a) is satisfied in S. Then, given 
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aé S, there exists an element of S, which we shall denote 
by e, such that 


a2A€ = 4a. 


At first sight, e appears to depend on a. Suppose, however, 
that b is any element of S. Then by (x) there exists ye S 
such that yAa = b and so we have 


bae = (yAa)Ae = yA(ade) = yAa= Bb. 

Hence for all elements s in S we have 

SAe = 5s. (14.1) 
In a similar manner we can prove that there exists an ele- 
ment fe S such that 

fas=s (14.2) 
for allse S. But if we put s = fin (14.1) we get fae =f 
and if we put s = e in (14.2) we get fae = e. Thus f= e 
and so we have established the existence of an identity 
element in S. By Theorem 8.1 this element is unique. 

We now apply condition («) to the elements a, e. Thus, 

given ae S, there exists an element a’ such that 

ada’ =e 
and an element a” such that 

a“ Aa=e. 
By Theorem 9.1, a” = a’ and so a possesses an inverse. It 
follows that S is a group with respect to A. © 


Because of the result proved in Theorem 14.1, we can 
define a group as follows: a group is a semi-group S such 
that for all a, be S there exist elements x, ye S satisfying 


anx=b, yaa=b. 
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It is an immediate consequence of Theorem 14.1 that 
any element of a group S can be expressed in one and only 
one way in the form aA x, where a is a fixed element of the 
group; in fact the element b is expressible as a A (a’ A b) 
where a’ denotes the inverse of a. Similarly any element of S 
can be expressed in the form y Aa in one and only one way. 

If S is a finite group of order g, we can list the distinct 
elements of S as 


Gis, esi Ge 
where a, is the identity element e. If 
a;Aa; = a;A 
then, by the associative law, 
(a; Aa;) Aa; = (a; Aa) AQ, 


whence 


CAA; = CAs 


so that 
a= 2. 
Thus, given any element a; € S, we see that 
ANG 0; NGs, 005; A0, 


again lists all the elements of S, because there are g such 
elements and all are distinct. 

In the remainder of this chapter, we shall denote a 
general group by G and we shall use the multiplicative nota- 
tion (see §11) for the binary operation in G. The identity of 
G will usually be denoted by e, the inverse of the element x 
by x! and the product of the elements x and y by xy. It 
must be remembered that, unless otherwise stated, xy and 
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yx are not necessarily the same, but, since the associative 
law is satisfied in every group, we have 


(xy)z = x(yz) 
for all x, y,z€G. Also, by Theorem 9.2, we have 
Go 
If x is any element of a group G, we write x? = xx, 
x? = x?x, x* = x°x and so on; thus x" is defined inductively 
by x” = x"~'x. Because the associative law is satisfied, we 


have the usual rules for indices, namely 


Dalida (14.3) 

oy =x, (14.4) 
where m and 7 are positive integers. Also, we have 

Gar)" = 
so that the inverse of x” is (x~')". We can extend the above 
rules to cases in which the integers m and n are either posi- 
tive or negative, making the convention that x° = e; also, 
if n is positive, we define x~" to be (x7~!)". 

We can use this index notation for any system involving 

a closed associative binary operation, with respect to which 
an identity and inverses exist and for which we are using the 
multiplicative notation. On the other hand, if we use the 
additive notation for such an operation, then the corres- 
ponding notation is that of “integral multiples’ and the 
appropriate definitions are 


2x 


ll 
bad 
as 
Ras 


(c= rae 
ce eee, 


er ey Te) 8 se) ie) ) 18) 8 ie 4 er ete matnust Da, ve) fe) se 
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In this notation the index laws (14.3) and (14.4) are re- 
placed by 


(mx) +(nx) = (m+n)x, 
n(mx) = (nm)x. 


Exercise 14.1. Let S be a semi-group having an identity 
element e. If every element x € S is such that there exists 
an element y (depending on x) in S such that xy = e, prove 
that S is a group. 

(Hint: Let y be such that xy = e. By hypothesis, there 
exists z such that yz = e. By using the associative law, show 
that x = z. This proves that x has an inverse.) 


Exercise 14.2. Let S be a semi-group such that (i) there 
exists an element e € S for which xe = x for all x € S, (ii) for 
each xe S there exists y such that xy = e. Prove that Sis a 
group. (Hint: there exists zsuch that yz = e. Then y(xy)z= 
(ye)z = yz and so yxe = e; therefore yx = e. Now prove 
that ex = x for all x, thus showing that e is the identity and 
that y is the inverse of z.) 


§15. Permutations. Suppose that Sis a set having exactly 
n elements, which are labelled 1, 2,3,...,. A permutation 
of these elements is simply a rearrangement of them, or, 
more precisely, a mapping of the set S onto itself. We 
denote the permutation in which the elements are arranged 
iy tie OLder #5, 15,» - 57, OY 


tees a 
ty iy Tye se is 
ie eS 
3 2 4 #1 


For example 
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is the permutation of 1, 2, 3, 4 which changes | into 3, 3 
into 4 and 4 into 1 but which does not affect 2. For con- 
venience we sometimes denote 


(; > Se 2 a b () 
F E : y ; 
im Uy Uy 6 6 o iF l, 


where it is understood that the symbol k takes in turn the 
valueg.1 2.3... ..50. 

As is well known in elementary algebra, there are alto- 
gether n! different permutations of 1, 2,3,...,m. For ex- 
ample, if n = 3 there are six permutations, which are as 
follows: 


123 12283 1-233 L238 [253 P23 

123), \23 1/,\3 T 2h We SS 2 aor 
k k j 

Kern = ; and z, = j be two permutations of 
k k 


1,2,3,...,m. We define the product 2,72, to be the per- 
mutation in which we first rearrange the integers 1,2,...,n 
according to z, and then rearrange the resulting integers 
according to z,. For example, suppose that 


cs Meat ween 
Pde 12 a5) ema et 


The rule is most conveniently worked out by writing z, 
with the entries in the first row following the order appearing 
in the second row of z,, the entries in the second row of 2, 
being inserted under the same integers as before. Thus, in 
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the above example, we can write 
in place of 


Then the rule for forming the product z,2, becomes 


Cee tees <4 2 3 
gain 34.) \ 4 io spe \ 253 1.4 


in the example and in the general case we can write 


an (D-DD 


Defining the product of two permutations in this way, 
we obtain a binary operation in the set of all permutations 
of 1,2,3,...,”. We call this binary operation multipli- 
cation of permutations and use the multiplicative notation. 
We observe first of all that it is a closed binary operation in 
the set of all permutations. It is not commutative; for 
example 


eee | 2S 4, Woe oa 

374. 4 2 A233 eS od 4 
but 

lee So C4 2 a Nee i 2 oe 

aoe 2. 3 Bl Aaa, SN 2 ed oe lay 


However, it is easily verified that the associative law is satis- 
fied. Moreover, there is an identity element, namely the 
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permutation 


1 2a ae east 
L. 2A25 004s orl 


and every element has an inverse: the inverse of the per- 


mutation 
ipix2’ 23 > ee rl 
i tf « de Nees 


is the permutation that restores the integers 71, i, 13,..., 4, 
to the natural order: that is, it replaces i, by 1, i, by 2 and 
so on. For example, the inverse of 


Ry Poe. 5s 
eas eo a ae ee 


ee a eee 
a eee aes |p 


Thus the set of all permutations of 1, 2, 3,...,” forms 
a group with respect to multiplication. This group is called 
the symmetric group of degree 7 and is denoted by S,. It is 
a finite group of order m! and it plays an important part in 
the theory of finite groups. 

Some mathematicians prefer to write 2,7, for the pro- 
duct which we have denoted by z,2,. The notation which 
we have used is the natural one when the permutations are 
regarded as “ operators ” on the integers 1, 2,..., 7. Thus, 
if x, operates first and 2, follows, we write m.x, for the 
combined operation. This conforms with the practice of 
writing f. g for the composition product of the mappings 
J,g. We recall that f. g is defined by 


(fog) (x) = f(g(x)), 
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so that, if fand g are regarded as operators, g is the operator 
which acts first. 


Exercise 15.1. Let S be any set and let G be the set of 
all one-one mappings of S onto itself. Prove that G is a 
group with respect to the composition product for mappings. 
(When S is finite, G is essentially the same as the symmetric 
group whose degree is equal to the number of elements in S.) 


§16. Subgroups. A subset H of a group G is called a 
subgroup of G if # is itself a group with respect to the binary 
operation in G. 

It should be observed that a subset of a group G is not 
necessarily a subgroup of G. For example, the set P of posi- 
tive integers is not a subgroup of Z with respect to addition, 
because P is not a group with respect to addition. 

From the definition of a group, it is clear that a subset 
H of a group Gis a subgroup of Gif and only if the following 
conditions are satisfied: 

G) H is closed with respect to the binary operation of 
G in the sense that if h,,h, ¢ H then h,h, € H, 

(ii) the associative law is satisfied in H, so that 
(h,h2)h3 = hy(h2h3) for all hy, h2, hz € H, 

(iii) H contains an identity element with respect to the 
binary operation of G, 

(iv) every element of H has an inverse in H with respect 
to the binary operation of G. 

In fact, (ii) is automatically satisfied in any subset H, 
because the binary operation of G is associative in G and 
so, in particular, the associative law is satisfied by elements 
of H. 

Condition (iii) is satisfied if and only if the identity e of 
G isin H. For clearly if e ¢ H, then H contains an identity 
element, since he = h = eh for any he H. On the other 
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hand, suppose that H contains an identity element 
Then we have hf = h. Let h~' be the inverse of A in G. 
Then we have h~'(hf) = h~'h = e and so ef = e, whence 
f=e. 

Similarly, condition (iv) is satisfied if and only if for 
each h € H, the inverse h~' of hin G is also in H. Clearly if 
h7} € H, then condition (iv) holds. Conversely if (iv) holds, 
then / has an inverse ’ in H and so Ah’ = e = h’h; but this 
shows that h’ is in fact the unique inverse h~? of h in G. 


Example 16.1. With respect to addition, Z is a sub- 
group of R and of Q and Q is a subgroup of R. 


Example 16.2. Let G be the group of rotations about a 
fixed point in the Euclidean plane (see Example 14.3). The 
rotations fo, ty/25 ta, f-”/2 form a subgroup of G. 


Example 16.3. Let $; be the symmetric group consisting 
of the six permutations of 1, 2, 3 as defined in the previous 
section. The permutations 


]” S23 | eee iy. 2 Ses 
ie eee 5 Die | 5h 4 
form a subgroup C; of order 3 of $3. 


The following theorem gives a useful criterion for deter- 
mining whether a subset H of a group G is a subgroup. 


THEOREM 16.1. Let H be a non-empty subset of a group 


G. Then H is a subgroup of G if and only if for all hy, hh, €¢ H 
we have 


h,hy' eH, 


where hz is the inverse of h, in G. 
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Proof. If H is a subgroup, then, given h, € H we know 
that hy! € H and so, for all h,, A, ¢ H we have h,hy! e€ H. 

Conversely, suppose that H is such that for allh,,h,¢ H 
we have h,hy' eH. Then for all h, ¢ H we have hyhy eH 
and so the identity e of G belongs to H. Suppose then that 
h, € H; since ee H we have, by our hypothesis, ehy ! ¢ H and 
therefore hy! H. Thus we have verified conditions (iii) and 
(iv) above; as already stated, (ii) is satisfied automatically so 
that all we need to do to prove that His a subgroup of Gis to 
verify that (i) is satisfied. Suppose therefore that h,, h, € H. 
Then we know that hy! H and so we have h,, hy! e€ H. 
Therefore, by our hypothesis, we have h,(hy')~' € H; but 
(io n= fi, and so Ah, 6 A. © 


Every group G has at least two subgroups—the set G it- 
self and the set E consisting of the identity element alone. A 
subgroup of G other than G or E is called a proper subgroup. 
We first consider the question of the existence of these in the 
case of infinite groups. 


THEOREM 16.2. Every infinite group has proper subgroups. 


Proof. Let G be an infinite group. Then G has an ele- 
ment x ~ e. We consider the powers x, x”, x° etc. 

Case (i). If there is an integer k such that x* = e, then 
the set of elements 


5 a Se eta a 


forms a subgroup H of G. To prove this, we first observe 
that the inverse of x’ (1Si<k) is x*~', since x'x*~? = x* = 
e = x*-'x!, Hence, given two elements x’, x/ 


(1SASk, 1SjSk) in the set H, we have 
(x) ot es oe ae, 
It follows at once that x*(x/)~! eH if h<j. On the other 
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hand, if h>j, x*t*-J = x Jed, so again x*(x/)"' eH. 
Therefore, by Theorem 16.1, H is a subgroup of G. Since 
G is an infinite group and H is finite, we have H # G. Also 
H has at least two elements and so H # E. Therefore H is 
a proper subgroup of G. 

Case (ii). Suppose that there is no integer k such that 
x* = e. Then the elements 


2 6 


me AG -2 ,-4 \- 
CPX eX, Xo eg Re he eee 


form an infinite subgroup H of G. We observe that x can- 
not be an element of H, for if we had x = x**, where s is an 
integer, then we would have x*5~' = e, which contradicts 
our hypothesis. Therefore, since x cannot belong to H, we 
have H # G, and so H is a proper subgroup of G. © 


For finite groups, however, the result proved above does 
not necessarily hold. A trivial example of a finite group 
with no proper subgroups is that of a group containing only 
one element—the identity—but there are also non-trivial 
finite groups of this kind. For example, the group C; de- 
fined in Example 16.3 has no proper subgroups. In fact the 
possible orders for subgroups of a finite group G are re- 
stricted by the order of G itself according to the result proved 
in the following theorem. 


THEOREM 16.3. (Lagrange’s Theorem.) /f G is a finite 
group of order g and H is a subgroup of G of order h, then h 
is a factor of g. 


Proof. Let H be a proper subgroup of G (if H is not a 
proper subgroup, then the result is trivial). Let 


xy Ca X25 0009 Xp, 


be the distinct elements of H. Since H # G, there is an 
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element y, € G such that y, ¢ H. Consider the elements 
Y2X 1; V2X2,26- > Y2Xh- 
These are distinct, since 
VaXi = Y2Xi > Xi = X; 
and none of them is in H, for otherwise we should have 
Y2Xi = X; 
for some i, j, so that 


y= ax) EH. 


We denote the subset of G consisting of the elements 
V2X1, V2X2,--+> 2X, by y,H and y,H is called a left coset 
of G relative to H. We can now list 2A distinct elements of 
G—the elements of H and those of the left coset y,H. If 
this exhausts G, we have g = 2h. Otherwise there exists an 
element y,¢G such that y,¢ H and y,¢y,H. The left 
coset y;H then consists of A distinct elements none of which 
is in H and none of which is in y,H, for 


Meee = VS dea Vox jj * € y2H. 


Hence we can list 3A distinct elements of G. If this exhausts 
G, we have g = 3h. Otherwise we can repeat the process, 
obtaining another set of h distinct elements none of which 
is one of the 3h elements already listed; then either g = 4h 
or the process can be repeated again. Since G has a finite 
number of elements, the process cannot be repeated inde- 
finitely. Therefore there exists an integer n such that g = nh, 
which proves the desired result. © 


If we write y, = e = xX,, then the theorem shows that 
the elements of G can be classified according to the table 


EAA D 
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Vit1 JiX2 + = ViXh 
Vax, V2%2 «+ « * V2Xh 
Vnt1 Vnt2 + + + YaXh 
In this table, the elements in the ith row form the left coset 
y,H of G. 


If two elements u, ve G belong to the same left coset 
then we have 


U= YX» V= VX, 
for some i, j, k and so 
A Seton ar | 
uv = XPV, Vik, = XTX, EH. 


Hence v € uH: that is, the element v belongs to the left coset 
uH of G. This shows that if z € y,H, then the left coset zH 
of G coincides with y,;H. 


Exercise 16.1. Let G be the group determined by the 
table (see §19) 


% 
Q 
o 
° 
Q 


“~ Qa &Q 
“aa @ A 8 
ao Ko ees 
eo“NM Qa 8 oS 
Sa HAS 
aaean as 
aera aay] Ss 


Prove that the elements e, a, b form a subgroup H, and that 
the elements e, c form a subgroup H,. Determine the left 


cosets of G relative to H, and the left cosets of G relative 
to H,. 
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THEOREM 16.4. If G is a finite group whose order is a 
prime number, then G has no proper subgroups. 


Proof. If G has order g and g is a prime number, then 
the only factors of g are g and 1. Hence, by Theorem 16.3, 
if H is a subgroup of G then H contains either one element 
or g distinct elements. In the former case, H = E since every 
subgroup contains the identity; in the latter H = G. © 


THEOREM 16.5. If x is an element of a finite group G of 
order g, then there is an integer r such that x" is the identity 
element of G and r is a factor of g. 


Proof. Since G is finite, the elements 
bie cas ate 


cannot all be distinct. Therefore there exist integers p,q 
Such that x’ = x=, where p>qg: Then we have x’<? = e. 
Thus there exist positive integers k such that x* = e. Sup- 
pose that r is the least positive integer with this property, 
so that x. = e and x” #e for all integers m such that 
1<m<r-—1. Then the elements 


Rx eee ok (Se) 


are distinct. For if not, there exist integers p,q such that 
l<q<psr and x? = x‘; then x? “4 = e and so there exists 
an integer m satisfying 1Sm<r—1 and x” = e, contrary 
to the hypothesis. 

The elements x, x?,..., x" form a subgroup H of G. 
To verify this, we use the argument used in the proof of 
Theorem 16.1. That r is a factor of g then follows from 
Theorem 16.3 and the fact that H is of order r. © 


The subgroup AH occurring in the proof of Theorem 16.5 
is an example of a finite cyclic group. In general, a finite 
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group G of order g is said to be cyclic if there is an element 
x of G such that the elements of G are 


XX, 00 eee +, ae): 


We also say that an infinite group G is cyclic if there is an 
element x of G such that the elements of G consist of 


Sg NO oo. gk” pe es 
Thus G consists of x, all its positive integral powers (these 
are distinct), the inverse x~! of x, all the positive integral 
powers of x~+ and the identity element e. In either of the 
two cases, finite or infinite, we say that the cyclic group is 
generated by the element x. 

Returning to Theorem 16.5, we now see that in that 
theorem it was proved that every element x of a finite group 
G generates a cyclic subgroup whose order is a factor of the 
order of G. The order of this subgroup is called the order 
of the element x. 

We conclude this section by noting two simple con- 
sequences of Theorem 16.5. 


THEOREM 16.6. Let H be a subset of a finite group G. If 
H is closed with respect to the binary operation of G, then H 
is a subgroup of G. 


Proof. The associative law is satisfied in H, because it 
is satisfied in G. If x is any element of H, then the cyclic 
subgroup generated by x is contained in H, because H is 
closed. Thus H contains all the elements 


Xj Xe ee — 0); 


which we assume to be the distinct elements of the cyclic 
subgroup. Therefore H contains the identity e and also the 
inverse x’~' of x. It follows that HisasubgroupofG. © 
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We note that this argument breaks down if G is not 
finite, for the subgroup generated by the element x may be 
infinite, in which case the identity element e is not of the 
form x" for any positive integer r and the assumption of 
closure alone is not sufficient. For example, with respect to 
addition the set of positive integers is closed in the group of 
all integers, but it does not form a subgroup. 


THEOREM 16.7. A finite group of prime order is a cyclic 
group. 


Proof. Let G be a finite group of prime order and let x 
be an element of G such that x 4 e. By Theorem 16.5, the 
element x generates a cyclic subgroup H of G. But, by 
Theorem 16.4, G has no proper subgroups. Hence H 
(which contains at least two elements) is the whole of G 
and so G is cyclic. © 


§17. Normal subgroups. We now return to a considera- 
tion of the idea of a coset, which was introduced in the 
proof of Lagrange’s theorem. Let G be any group, finite or 
infinite, and let H be a subgroup of G. In this section, it is 
convenient to denote elements of G by g, g1, g, etc., and 
elements of H by h, h,, h, etc. This enables us to identify 
the sources of the elements. 

If g €G, then the set gH consisting of all elements of G 
of the form gh, where he H, is the left coset of G (with 
respect to H) containing the element g. Similarly the right 
coset of G containing the element g is the set Hg of elements 
of G of the form hg, where he H. 

Suppose that g,,g, are two elements of G and that 
g,H, gH are the left cosets containing g, and g, respec- 
tively. If g,H and g,H have an element in common there 
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exist 1,, h, ¢ H such that 


Bh, = B2hp. 
Then 


fl = g2hzh;* € gH, 


so that g, belongs to the left coset containing g, and in fact 
every element of g,H also belongs to g,H. Similarly every 
element of g,H also belongs to g,H and therefore g,H and 
g,H coincide. Thus there are two distinct possibilities and 
no more: either 

(i) g,H and gH have an element in common, in which 
case g,H = g,H and g,, g, belong to the same left coset 
of G, or 

(ii) g,H and g,H have no element in common, in which 
case g,, 8, belong to different left cosets of G. 

If we write g;~g, (mod H) whenever g,, g, belong to 
the same left coset of G, then ~ (mod #) determines an 
equivalence relation in the set G. The corresponding equi- 
valence classes are the left cosets of G. It is natural to 
enquire whether this equivalence relation is stable relative 
to the binary operation in G; if so, as shown in §5, we can 
define an induced binary operation in the set of left cosets 
of G. For the equivalence relation to be stable, we require 
the following condition (a) to be satisfied: 


g,~g, (mod H), g.~g (mod H) => g,g.~g1g85 (mod H). 


We now consider whether this is the case. 

Suppose first that this condition is satisfied. If he H 
and geG, then h~e (mod H) and g~g (mod H). Hence 
hg~eg (mod H). Since eg = g, this means that hg is an 
element of the left coset gH for every h. That is to say, each 
element of the right coset Hg is an element of the left coset 
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gH. Again, since gh~g (mod H) and g~!~g~! (mod H), 
we have ghg-'!~e (mod H), or ghg-! = h, € H. Thus, gh = 
h,g showing that every element of gH is an element of Hg. 
It follows that the right and left cosets of G coincide if the 
condition (c) holds. 

Conversely, suppose that the right and left cosets of G 
with respect to H coincide. If g;~g,; (mod A) and g,~g; 
(mod #H), then we can write g, = gjh, and g, = g3h, for 
some elements h,, h, of H. Hence 


8182 = Bihygzhz. 


But, by hypothesis, Hg, = gzH and so h,g3 = gzh, for 
some h, e H. Therefore 

£182 = 8182h3h2 
so that g122~g182 (mod H). 

It follows that (c) holds in all cases if and only if 
gH = Hg for every g€ G. We conclude, therefore, that the 
equivalence relation ~ (mod #) is stable if and only if each 
left coset of G coincides with the corresponding right coset. 
This result leads us to formulate the following definition. 
A subgroup H of a group G is said to be normal or to be an 
invariant subgroup if every left coset of G with respect to H 
coincides with the corresponding right coset with respect to 
H. When dealing with normal subgroups, we do not need 
to distinguish between left cosets and right cosets and we 
therefore refer to them simply as cosets. The coset of G 
containing x can then be written either as xH or Hx. 

The condition gH = Hg can also be written gHg~1 = H, 
which means that the subset of G consisting of all elements 
of the form ghg~', where he H, coincides with H. If H is 
finite, consisting of the elements h,,...,, then, given any 
geéG, the elements gh,g~', ghag™*,..., gh,g-' form a 
rearrangement of the elements h,,..., Ay. 
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Example 17.1. In an abelian group, every subgroup is 
normal, because a left coset xH is the same as the corres- 
ponding right coset Hx. 


Example 17.2. Let G be the group determined by the 
table in Exercise 16.1. The subgroup H, consisting of the 
elements e, a, b is normal but the subgroup H, consisting 
of the elements e, c is not normal. 


§18. Factor groups. Suppose now that H is a normal 
subgroup of a group G. Then the equivalence relation ~ 
(mod H) defined in §17 is stable with respect to the binary 
operation in G and so, by the results of §5, we can define an 
induced binary operation in the set of cosets of G. We shall 
use the multiplicative notation for this binary operation; 
the product of the cosets g,H and gH is given by the rule 


(gH) (g2H) = (8182). (18.1) 


This is in accordance with the definition of the induced 
binary operation given in §5. 

THEOREM 18.1. Jf H is a normal subgroup of G, the set 
of all cosets of G with respect to H forms a group with respect 
to the induced binary operation determined by the equivalence 
relation ~ (mod HA). 


Proof. The product of the cosets g,H and g,H is given 
by (18.1); since this product is the coset (g,2,)H, the set of 
all cosets is closed with respect to multiplication. 

Let g,H, g.,H, g3H be three cosets of G. Then 


{(g,H)(g.H)} (g,;H) = {(8182)H} (g3H) = {(8182)¢3}H 
(18.2) 

and 

(gH) {(g2H)(g3H)} = (g,H) {(g2g3)H} = (g,(g2¢3)}H. 
(18.3) 
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Since (g;2)g3 = 2,(g2g3), the right-hand sides of both 
(18.2) and (18.3) are the same. Hence the associative law is 
satisfied. 

For any coset gH, we have 


(gH)(eH) = gH = (eH)(gH) 


and thus eH (=#) is an identity element with respect to 
multiplication of cosets. Finally 


(gH)(g~*H) = (gg"')H = eH 
and 
(g-H)(gH) = (g-'g)H = eH, 


and so gH has an inverse, namely g~'H. .o) 


The group occurring in Theorem 18.1 is of prime im- 
portance in group theory. It is called the factor group of 
G by H and it is denoted by G/H. Thus, if H is a normal 
subgroup of G, the factor group of G by di is the set of 
cosets of G together with the binary operation determined 
by (18.1). We notice that in order to define the factor group 
of G by H it is essential that H be a normal subgroup of G. 

If G is a finite group of order m and H is a normal sub- 
group of order n, then the number of distinct cosets of G 
relative to H is equal to m/n, as shown in the proof of 
Theorem 16.3. In this case the factor group G/H is a finite 
group of order m/n. 


§19. Isomorphism. In abstract algebra, we are more 
concerned with operations and relations in a set than with 
the nature of the elements of the set. For example, in 
dealing with the symmetric group $;, the fact that the 
elements of this group are permutations of 1, 2,3 is rela- 
tively unimportant from our point of view; we are more 
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interested in the way in which these permutations are com- 
bined under the binary operation which determines the 
group. In fact, we have all the information about S$, that 
we require if we represent it in the following table: 


eo mae Dc Tae, 


“Qo &R 
“San fa O 
Mo Ne G&A 
a Qaon o 
Sao ao 
eca gas 


_ {1 253)\ S923 ay lee 
(1 33020 3 ee ieee 
Df Lees 12 3 
(Gay ee ae fos cum 


The above table can be constructed by using the definition 
of multiplication of permutations given in §15. However, 
from the point of view of the structure of the group it is not 
necessary to go through this construction. Suppose that 
we have a set consisting of six objects of any kind we like 
and that we label these objects e, a, b, c,d, f. Defining a 
binary operation in the set by means of the above table, we 
obtain a group G. To all intents and purposes the group G 
is the same as S,; the only difference between them lies in 
the nature of their elements. 

In such circumstances we say that we have two groups 
which are isomorphic. Intuitively this means that their 
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structures are essentially the same. Mathematically we 
formulate the idea of isomorphism as follows. 

We recall that a one-one mapping f:S, > S, of S, onto 
S; associates with each element x of S, a unique element 
J(x) of S, in such a way that each element y of S, is ex- 
pressible uniquely in the form f(x) for some x € S,. Let G, 
and G, be two groups and suppose that /:G, > G, is a 
one-one mapping of G, onto G, such that for all x,, x, 6G, 
we have 

f(%1%2) = F(X) FO). (19.1) 

Then fis called an isomorphism of G, onto G, and G, is said 
to be isomorphic to G,. (It should be noticed that we are 
using the multiplicative notation for both G, and G,.) The 
relationship of isomorphism between two groups is a sym- 
metric one, since, if f is an isomorphism of G, onto G, and 
if y, = f(x,) and y, = f(x), then, from (19.1) 


f-*O1y2) = fF DSO 2)) = FF 1x2) = x4x2 
=f ~*(y)f-*02); 
where f_' is the mapping inverse to f as in §2. Thus 
f~':G, > G, isanisomorphism. In other words, G, is iso- 
morphic to G, if G, is isomorphic to G). 

In determining an isomorphism of G, onto G, we first 
match the elements by means of a one-one correspondence 
and then we ensure that the element corresponding to the 
product of two elements of G, is the product of the corres- 
ponding elements of G,. In other words, the correspondence 
preserves the binary operation in G,. 

It is easily seen that isomorphism determines an equi- 
valence relation for groups. In studying the abstract pro- 
perties of groups we are really concerned with the equival- 
ence classes determined by this relation rather than with 
individual groups. These equivalence classes are sometimes 
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referred to as “‘ abstract’ groups. A particular member of 
an equivalence class is called a realisation or representation 
of the corresponding abstract group. 


Example 19.1. The following are three realisations of 
the same abstract group: 

(i) the group consisting of the letters e, a, b,c, d,f 
together with multiplication determined by the table at the 
beginning of this section; 

(ii) the group consisting of the permutations 


| ae [2 | ee 
P22? 37 2. 3 ee eee 
he 2 | ie? eS | i ee) 
1 3 2 \2Z 1 Sy \ Sees 


together with multiplication defined as in §15; 

(iii) the group consisting of mappings ¢,, $,, $4, Per Pas 
és, each of which maps the set of real numbers x satisfying 
x#0, x1 onto itself, and which are given by 
1 
1— x’ 


$0) =, $0) =1-5, hG) = 


bo) =5, bX) = $0) = 1H; 


it 
multiplication is given by the composition operation, so 
that, for example 


1 


($.6,(x) = $(6(x)) = $. (4) = a) = 


=== 400) 
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We now prove a result which says effectively that any 
finite group can be realised as a group of permutations. 


THEOREM 19.1. (Cayley’s Theorem.) Let G be a finite 
group of order m. Then G is isomorphic to a subgroup of the 
symmetric group S,,. 

Proof. Let 


X19 X29 2009 Xm 


be the elements of G. If a is a particular element of G, then 
the products 


AX 4, AXg, + + 5 AXy 


are again the elements of G in some order (see §14). Thus 
AX, = Xj, AX2 = Xj,,-+++,0Xm = X;,, Where i,,i,,...,i, 
are the integers 1,2,...,m in some order. Then these 
integers and their order determine a permutation 


ee ac ee 
ae ee 


of 1,2,...,m. We write f(a) for this permutation. 

Thus with each element a of G we can associate a permu- 
tation f(a) belonging to S,,. If a and b are two distinct ele- 
ments of G, then f(a) # (0), for if ax, = bx, then a = b. 
Thus f determines a one-one correspondence between G and 
a certain subset 7, of S,,. There are m distinct permutations 
ai Te 

Suppose now that a, b are two elements of G and that 


| Pe a ee ¢ 
fa = (; i, wees 


| OR NS 
5) (a Pel: 
FO) Cc J2 es 
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Then axf'=x,, “Gx7=%,,...,.cnn=—"s,,, “atttl eo 
bax, = bx;, = x;,, bax, = bx;, = x;,,... Therefore 


25... ee 
f(ba) = ( ae ‘ey = fe) fla). (19.2) 


It follows from (19.2) that the set 7,, is closed with respect 
to multiplication of permutations. Therefore, by Theorem 
16.6, T,, is a subgroup of the symmetric group S,,. More- 
over (19.2) shows that the one-one correspondence f: GT, 
is an isomorphism of G onto T,,. © 

When a binary operation A in a set S is defined by a 
table, as in Example 14.2, the direct verification that the 
operation is associative can be exceedingly laborious. To 
overcome this difficulty one can compute from the table 
the permutations f(a), f(5), ..., as described above, corres- 
ponding to the elements a, 6,... and then examine in each 
case whether f(a) f(b) = f(ab). If in each case the answer 
is in the affirmative then the set of permutations is a repre- 
sentation of the corresponding abstract set S and the 
operation A in Sis therefore associative since multiplication 
of permutations is necessarily associative. 

Thus, to verify the associative law in Example 14.2 it is 
sufficient to construct from the table the permutations 


c=to-(123%) a=no=(1224) 


p=n0-(J224), cxno=(1234) 


and to verify that the multiplication table for E, A, B, C 
has the same structure as that quoted for e, a, b, c. 


ji? 
Ji, 


§20. Homomorphism. The two important characteristics 
of an isomorphism of a group G, onto a group G, are that 
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it is a one-one correspondence and that it preserves the 
binary operation of G,. If we drop the first of these require- 
ments we still have a relationship of some significance, be- 
cause it connects the structures of the two groups. Such a 
relationship between the groups is called a homomorphism. 
Mathematically, we formulate the definition as follows: a 
homomorphism of a group G, into a group G, is a mapping 
f of G, into G, such that, for all x, ye G,, we have 


S(xy) = FX) £0). 


In this definition, we do not require f to be a mapping 
onto G,. A homomorphism which maps G, onto G, is 
sometimes called an epimorphism. (See §2 for the distinction 
between “into” and “ onto ’’.) Nor do we require that a 
homomorphism be one-one; a homomorphism which is 
one-one is sometimes called a monomorphism. 

Suppose that f:G, - G, is a homomorphism of G, 
into G,. We define the image under f to be the subset of G, 
of elements of the form f(x) for some x in G,; we denote 
the image under f by Im/ or by f(G,). We also define the 
kernel of f to be the subset of elements x of G, such that 
f(x) = e2, where e, is the identity element of G,. We denote 
the kernel of f by Ker f or by f~ ‘(e,). 

Suppose that e, is the identity of G,. Then we have 


F(x) = FC1*) = FEDS) 


and so, multiplying both sides of this equation on the right 
by the inverse in G, of f(x), we get 


e2 = flex). 
We can now prove that Im f and Kerf are subgroups of 
G, and G, respectively. 
(i) Let f(x) and f(y) be elements of Imf. Then their 
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product f(x) f(y) is equal to f(xy) and is therefore an element 
of Im/f. Since Im fis a subset of the group G,, multiplica- 
tion in Im f obeys the associative law. The identity element 
e, of G, is equal to f(e,), as has just been proved, and so 
lies in Imf. Finally, 


e2 = f(e,) = f(xx7") =f) f~"), 

e€2 = fer) = f(x "x) = fx") FQ). 
Consequently, the inverse of f(x) is f(x~*), which is also 
in Im f. Thus Im /f is a subgroup of G,. 

(ii) If x and y belong to Kerf, then f(x) = e, and 
f(y) = e, so that f(xy) = e,” = e,; hence xye Kerf. Mul- 
tiplication in Ker f satisfies the associative law, because this 
is satisfied in G,. The identity element e, of G, is in Kerf 
because f(e,) = e, and if f(x) = e2, we also have f(x~!) = 
e,. Therefore Ker fis a subgroup of G,. 

In fact, Ker fis a normal subgroup of G,. For suppose 
that u is any element of G, and that xe Kerf. Then 


f(uxu™") = fm foflu*) = flue, fu“ *) = fw fu’) 
= f(uu') = fle:) = e2. 


Therefore uxu~' € Ker fand so ux is expressible in the form 
yu where y € Ker f. It follows that the left and right cosets 
of G with respect to Ker f coincide, so that Ker fis a normal 
subgroup of G. 

We conclude this chapter by proving two theorems which 
are concerned with the close connection between the con- 
cepts of homomorphism and factor group. 


THEOREM 20.1. Let G be a group and let H be a normal 
subgroup of G. Then there is a homomorphism of G onto 
G/H. 


Proof. Let x be any element of G and write f(x) for the 
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corresponding coset xH. This determines a mapping f of 
G onto the factor group G/H. Since the binary operation 
in G/H is determined by 


(x) (yH) = (xy)H, 


SO)LY) = f(xy) 
and so f is a homomorphism. © 


We sometimes refer to the homomorphism established 


in Theorem 20.1 as the natural homomorphism of G onto 
G/H. 


we have 


THEOREM 20.2. Let f be a homomorphism of a group G, 
into a group G,. Then G,/Ker f is isomorphic with Im f. 

Proof. Write H = Kerf. We can define a mapping f’ 
of G,/H into Im f by writing 


f(x) = f), 
for if xH = yH, then y~!x eH; therefore f(y~'x) = e,, 
whence f(y 1) f(x) = e, and so f(x) = f(y). It was necessary 
to prove this, so that we can be sure that f’(x#Z) is uniquely 
defined. In fact, f’ is a one-one mapping of G,/H onto 
Im f. That /’ is a mapping onto Im fis immediate from the 
definition, so all we must show is that f’(xH) = f’(yH) im- 
plies that xH = yH. This is readily proved, because if 
f'(xH) = f’(yH) then f(x) = f(y) and therefore f(xy ') =e). 
Thus xy~! € H, which implies that xH = yH. 
We now show that f’ is an isomorphism. We have 


SOA) SO) = F(x) £(Y) = f(xy) 
= f'(xy)H) =f'(aM)(yH)), 
which shows that f’ is a homomorphism. It is in fact an 
isomorphism since we have already shown that the corres- 


pondence is one-one and maps G,/H onto Im f. © 
EAA E 


CHAPTER III 
RINGS, INTEGRAL DOMAINS AND FIELDS 


§21. Introduction. The present chapter is concerned with 
algebraic systems which involve two binary operations. 
Familiar examples of such systems are the integers and the 
real numbers with, in each case, ordinary addition and 
multiplication as the binary operations. As we have seen, 
although the integers form a semi-group they do not form 
a group with respect to multiplication, even if we exclude 
the element 0. On the other hand, the integers form an 
abelian group with respect to addition. Roughly speaking, 
a ring is a system in which the above properties, together 
with the distributive laws, are characteristic: a precise defi- 
nition is given below in §22. Other examples of rings arise 
from quite different sources. In particular, they arise in 
connection with matrices, which we shall discuss in §23. 
These provide examples which illustrate the concept of a 
ring more effectively than does the example of the integers; 
the latter, in fact, typify a special kind of ring known as an 
integral domain, which is defined in §25. 

The real numbers, unlike the integers, have the special 
property that the non-zero elements form an abelian group 
with respect to multiplication, as well as sharing with the 
integers the property that all the elements form an abelian 


group with respect to addition. A field is a system in which 
58 
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these properties, together with the distributive laws, are 
characteristic: a precise definition is given in §24. 

For all the systems discussed in the present chapter, we 
shall call the two binary operations addition and multipli- 
cation and we shall use the additive and multiplicative nota- 
tions respectively. 


§22. Rings. Let S be any set and suppose that there are 
two binary operations, addition and multiplication, in S. If 
Sis an abelian group with respect to addition and is a semi- 
group with respect to multiplication and if also the distri- 
butive laws 


x(y+z) = xy+xz, 

(x+y)z = xz+yz 
are satisfied for all x, y, ze S, then the set S together with 
these two operations is called a ring. Thus a ring is a set S 
together with two operations, addition and multiplication, 
such that the following conditions are satisfied: 


(1) if x, ye S, then x+yeS, 
(2) x+y =yt+x (all x, yeS), 
(3) (x+y)+z2 = x+(y+z) (all x,y, ze S), 
(4) there is an element 0 € S such that x+0 = x = 0+ 
(all x € S), 
(5) for each x € S, there exists an element —x € S such 
that x+(—x) = 0 = (—x)+ x, 
(6) if x, ye S, then xyeS, 
(7) (xy)z = x(yz). (all x, y,z€ S), 
(8) @  x(y+z) = xy +z, 
(ii) (x+y)z = xz+yz (all x, y, ze S). 
As in the case of groups, we sometimes use a looser 
terminology, saying that the set S is a ring with respect to 
the operations. 
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Example 22.1. The set R is a ring with respect to ordin- 
ary addition and multiplication; so are the sets Q and Z, 
but P is not a ring with respect to these operations. 

The most trivial example of a ring is that in which S 
consists of a single element 0, addition and multiplication 
being defined by 0+0 = 0 and 00 =0. Such a ring is 
called a zero ring. 

Of the two operations in a ring R, addition is “‘ well- 
behaved ”’ in the sense that it satisfies the commutative and 
associative laws and there exist an identity element and in- 
verses. On the other hand, multiplication is not so well- 
behaved because it need not be commutative and there may 
be no identity element or inverses. Thus the beginner is not 
likely to encounter difficulty with addition but must exercise 
care when considering multiplication. 

We now prove a simple theorem on rings, showing that 
one of the familiar properties of real numbers is satisfied in 
these more general systems. 


THEOREM 22.1. If x is any element of a ring R, then 
x0 = 0 = Ox. 
Proof. We have 
x* = x(x+0) = x7?4+x0. 
Hence, adding —(x*) to both sides, we obtain 
0 = x0. 
A similar argument shows that 0x = 0. © 


Although this proof is very short, the reader is recom- 
mended to study it carefully. The proof uses several of the 
defining conditions for a ring and it is worth pausing to 
consider just which of them are used. 
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As a corollary to Theorem 22.1, we notice that the 
familiar rule 

x(—y) = —(xy) 
holds in a ring. For, 
xy+x(—y) = x(v+(—y)) = x0 = 0, 
from which the result follows at once. Similarly 
a) = 20) 
and it follows that 
(—x)(-y) = —(—x)y) = —(- (Gy) =, 


which proves another well-known rule of elementary 
algebra. 


Exercise 22.1. Let R be a ring having a unity: that is, 
R has an identity with respect to multiplication. Prove that 
the elements of R which possess inverses with respect to 
multiplication form a group. 


§23. Matrices. Suppose that x,, x2,..., x, are g num- 
bers. Let y,, y2,..., y, be p numbers given by the equations 


Vi = 441%1 + 442%. + o Oo oO FAX qy 
Yo = AqyXyt+Qg2Xot~- - « FrgXy 


ApiX{HayaX%o+ . + + HAggXp 


I 


Vp 
vicre the COcHICIeNtS Ayy, 15, -+ +» Ags4213- ++» Fpq are 
certain numbers. More concisely, we can write the equa- 
tions in the form 


q 
Ji = as Aj jX; (i = i Zs eves p). (23.1) 
j=1 
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We can interpret these equations as determining an opera- 
tion"on x}; Xz, .. 5 xp tow prediiés yyy; Ee 
operation is uniquely determined by @;;, 412, --. 5 pq Such 
an operation is called a homogeneous linear transformation. 

When we regard a set of numbers as operating on 
another set in this way, it is clearly desirable to have suitable 
notations to indicate the point of view. Thus an alternative 
way of expressing (23.1) is to write 


Va Qy1 432 ++ + Aig x4 
Va} 421 422 ++ + Arg X2 

— 5 eee « es} 
Vp DiiGgaae ie « are Be 


it being understood that this means the same as (23.1). 
This leads us to make the following definition. By a 
pxq matrix A we mean a set of pq numbers 4q;; 


(i= 1,2,...,p;j = 1,2,...,q) arranged in a rectangular 
array 
ai Qi2 . . . Qigq 
421 422 + + » A2q 
(23.3) 
CriwiG pate pege Ang 


consisting of p rows and g columns. The numbers 


iy Giz +. + Gig 
form what is called the ith row of the matrix and 
a4; 
a 
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form the jth column of the matrix. We call the number a; J 
the (i, j)th element of A; this is the number in the ith row 
and jth column of A. If p = g, we sometimes refer to A as 
a square matrix of order p. 

The matrix displayed in (23.3) can conveniently be de- 
noted by [a;,] if no confusion can arise concerning p and q. 
We use the single capital letter A if it is not necessary to 
specify the elements. Some authors use rounded brackets 
to denote matrices. 

For present purposes, the reader should regard the num- 
Demepe. . 1, yy, anda), . .. awas: being teal 
numbers. However, as we shall see, the ideas can easily be 
extended to other cases. 

Returning to (23.2), we see that this equation involves 
three matrices; as well as the pxq matrix A = [a,,], we 
have a px1 matrix Y and aq x1 matrix X defined by 


el x 
Y2 X2 


Vp Xq 
Thus (23.1) can be written in the form 
Y = AX, (23.4) 


Y= 


it being understood that juxtaposition of the matrices A, X 
on the right indicates that we are regarding A as operating 
on X to produce Y. 

A matrix (such as Y) which has only one column is 
sometimes called a column vector; the elements of such a 
matrix are called its components. Thus Y is a column vector 
having p components and X is a column vector having q 
components. In a similar way, a 1 xq matrix (that is, a 
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matrix with only one row) is called a row vector, having g 
components. 

The definition of matrix which we have given arises out 
of a consideration of homogeneous linear transformations, 
in which matrices appear as operators on column vectors. 
However, matrices do not need to be tied to transformations: 
they have an independent existence of their own. Thus in 
many cases we shall not think of our matrices as operators, 
but just as sets of numbers arranged in rectangular arrays. 
Nevertheless the concept of matrix multiplication is best 
approached through the idea of a matrix as an operator. 

Suppose that we have two homogeneous linear transform- 
ations, the first transforming q numbers x,,%X2,...,%q 
into p numbers y,, y2,..., ¥, and the second transforming 
ViwV2, --» Vp intownenumbersezgyzs, .. giz. Tet these 
transformations be given by 


q 
yi 2 Gj jXj (i = 1, 2, goog P), (23.5) 
1S 


Pp 
pe oy by (h = 1,2,... mm). (C3i6) 


In matrix notation, we can write these transformations as 
ee (23.7) 
Vi eee 131) (8 (23.8) 


where A, X, Y are defined as before, B is the m x p matrix 
whose (A, k)th element is 5,, and Z is the column vector 


with m components z,,Z2,...,2Z,- Then if we substitute 
from (23.7) into (23.8) we get 
Z = B(AX). (23:9) 


The temptation now is to remove the brackets in this equa- 
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tion, but we are not yet entitled to do this, because we have 
not yet attached any meaning to an expression such as 
BAX. However, in the notation used in (23.5) and (23.6), 
equation (23.9) corresponds to the equations 


P q 
Zp => » Dax (s ans) (23.10) 
k=1 j=1 


obtained by substituting from (23.5) into (23.6). We can 
rewrite equations (23.10) in the form 


q 
Z— > yxy (h =1,2,...,m), 2311) 
j=1 


where c,; is defined by 


P 


Chj rs » DipAyj- (23.12) 


k=1 
This is done by collecting together the coefficients of 


Minxosse,%, For example, the coeflicient of x, is 
By1414+Dp2401+ ... +5yp4,,. Thus we have a transform- 
Demme —X1, %>,.--4%4 10 Z4,25,...;2_ given by 


(23.10), the matrix equivalent of which is 
ZN (23.13) 


where C is the mxg matrix whose (h, j)th element c,,; is 
given by (23.12). 

Wieiranstormation {rom X15 vgs 5 Xo 1024; Sas. 5 Ze 
is effected by first operating by means of A and then oper- 
ating on the resulting numbers by means of B. Thus C can 
be regarded as the result of A followed by B and in view of 
(23.9) it is natural to define C to be the product BA. Thus, 
if B is the m x p matrix whose (h, k)th element is 5,, and A 
is the p xq matrix whose (k, /)th element is a,;, then BA is 
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the m xq matrix whose (h, j)th element is 
P 
YY bag; 
k=1 


The operation of forming the product BA from B and A is 
called matrix multiplication. In the set of all matrices, this 
determines a binary operation. 

The first thing that we should notice about this binary 
operation is that it is not closed in the set of all matrices. 
According to the definition, the product of an m x p matrix 
B and a p xq matrix A is defined and is an m xq matrix C. 
The rule by which we obtain BA from B and A is sometimes 
known as ‘“‘ row into column multiplication ’’ and we say 
that the (i, /)th element of BA is obtained by multiplying 
the ith row of B by the jth column of A. By the product of 
the ith row of B and the jth column of A we mean 


Pp 
YY bia. 
k=1 


Thus multiplying a row into a column simply means adding 
the products of elements in corresponding positions, to do 
which it is essential that there should be exactly as many 
elements in the row as there are in the column. The rule, 
and therefore the definition of multiplication, is not applic- 
able if the number of columns of B is not the same as the 
number of rows of A. 

If B, A are such that the product BA can be defined 
according to the above definition, then we say that B and 
A are conformable for multiplication. We notice that even 
if BA is defined, then AB is not necessarily defined, so that 
in talking about conformability for multiplication it is neces- 
sary to rcfer to the matrices in the correct order. 
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Suppose now that B is an m xp matrix and that A is a 
pxXm matrix. Then both AB and BA are defined. We can 
therefore ask the question: are they the same? In view of 
the fact that we have arrived at the definition of multipli- 
cation by considering successive operations, there is no 
reason why we should expect an affirmative answer; it is 
common experience that the order in which operations are 
carried out is significant. In fact BA is an m xm matrix but 
AB is a p Xp matrix, so that unless p = m we cannot have 
AB = BA. If p = m, then it is easily seen that AB = BA 
if and only if the equations 


Pp Pp 
_ andy; = » Binh; 
k=1 k=1 


hold for all i, 7. Since in general these equations are not 
satisfied, the matrices AB and BA are in general different, 
even when A and B are both p xp matrices. 

It is worth noticing that this concept of product of 
matrices and that of the product of permutations are closely 
connected with the more general concept of the composition 
product of two mappings, introduced in Example 3.3. In 
the cases of matrices and permutations we formulate the 
definition of multiplication of two objects by regarding these 
objects as operating on the elements of some set. Essentially 
we are regarding the objects as certain types of mappings 
and we define multiplication in the same way as we define 
the composition product of two mappings. One of the 
features of the operation of taking the composition product 
is that it is non-commutative, a fact which is reflected in the 
non-commutativity of permutations and of matrices with 
respect to multiplication. 

Although matrix multiplication is non-commutative, it 
is certainly associative. To prove this, we must show that 
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if A, B, C are matrices such that one of the products (AB)C 
and A(BC) is defined then so is the other and they are equal; 
this is the requirement for associativity according to the 
definition given in §7. Suppose, for example, that (AB)C is 
defined. Then the number of columns of A is equal to the 
number of rows of B, since AB is defined; and the number 
of columns of AB is equal to the number of rows of C. But, 
by the definition of multiplication, the number of columns 
of B is the same as the number of columns of AB whenever 
the latter is defined. Thus A is a pxq matrix, Bisagxr 
matrix and C is anr xs matrix, for some p, q, r,s. Then BC 
is defined and is a g xs matrix; also A(BC) is defined and is 
a pxs matrix. This establishes the existence of the product 
A(BC); it remains to be shown that A(BC) = (AB)C. The 
(i, j)th element of BC is 


r 
eS Dixy; 
k=1 


and the (i, j)th element of A(BC) is therefore 


q [2 
> ai, (z, buss) 


h=1 
Rearranging the order in which the summations are per- 
formed (the beginner is recommended to write out the details 
in full for some particular values of p, g, r, s) we see that the 
(i, j)th element of A(BC) is 
r q 
» (3 ana) Ckj 
k=1 \h=1 
and this is in fact the (i, /)th element of (AB)C. Thus the 
matrices (AB)C and A(BC) have their (i, j)th elements the 
same and so (AB)C = A(BC). 
In many problems concerning matrices we employ an 
operation of addition as well as multiplication. The appro- 
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priate definition is simply one which “ takes over” the 
operation of addition applied to the elements of the matrices. 
In this respect it is essentially different from multiplication, 
which is based on the idea of combining operations. As 
might be expected, its close connection with the operation 
of addition of numbers results in its being “‘ well-behaved ” 
in the sense that many of the familiar rules are obeyed. 
However, we must be careful in one respect, for the defin- 
ition applies only to two matrices of the same size and shape. 
Suppose that A = [a;;] and B = [b,,;] are two pxq 
matrices. Then we define the sum A+B to be the pxq 
matrix whose (i, /)th element is a;;+5,;. For example 


ee eee! —4 eal 1 
= lumrd—2 Some 6 Oni 


The operation of forming the sum of two matrices in this 
way is called matrix addition. We say that two matrices A 
and B are conformable for addition whenever A+B exists 
according to this definition. If A is a p xq matrix and B is 
a p’ xq’ matrix, then A and B are conformable for addition 
if and only if p’ = pandq’ = q. 

Thus we have another binary operation in the set of all 
matrices. Like matrix multiplication, it is not a closed 
operation in this set. However, it is easily seen that the 
operation is both commutative and associative; thus if one 
of A+B and B+ is defined, then so is the other and 


A+B = B+A4; 
if one of (A+B)+C and 4+(B+C) is defined, then so is 
the other and 
(A+ B)+C = A+(B+C). 


Moreover, matrix multiplication is distributive with respect 
to addition in the sense that 
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(A+B)C = AC+BC 
and 
A(B+C) = AB+AC 


whenever the appropriate sums and products are defined. 
(See Exercise 23.1.) 

We now consider the existence of identity elements for 
addition of matrices. In view of the fact that the set of all 
matrices is not closed with respect to addition, we must be 
careful to recall the precise requirement for an identity 
element. 

Suppose that A is any pxq matrix. If O:,,,) denotes 
the p x q matrix whose elements are all zero, then A+ O¢, 4) 
is defined and 


A+ Ovp,q) = A. 
Similarly 
OcatA = A, 


Thus O,,,4) is an identity element for the binary operation 
of matrix addition. It is called the zero p xq matrix or just 
the zero matrix if its order is clear; it must be understood, 
however, that there are infinitely many different zero mat- 
rices, because Oc, g) = Oy, » if and only ifp = randg = s. 
We can easily see that any identity element for matrix addi- 
tion is necessarily a zero matrix. 

In a similar way, we can show that there are identity 
elements for matrix multiplication. Let A be any pxq 
matrix and let [(,, be the g xq matrix whose (i, /)th element 
is unity if i = j and is zero if i 4 7: for example, 


120736 
Aiea) | 
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Then we find that 

Al =A 
and similarly that 

Iq B = B, 


where B is any matrix having g rows. Thus J(,) is such that 
Al, = Aand I,,,B = B whenever the appropriate products 
are defined and so, according to the definition in §8, it is an 
identity element for matrix multiplication. We call J(,) the 
unit g x g matrix or just the unit matrix if its order is clear. 

We have seen that the operations of addition and multi- 
plication are not closed in the set of all matrices. In the 
case of both operations, there is an infinite number of iden- 
tity elements, whereas in the case of a closed operation there 
cannot be more than one such element (see Theorem 8.1). 
If, however, we concentrate on suitably restricted sets of 
matrices, then we obtain systems in which one or both of 
the operations are closed. Such systems are a fruitful source 
for examples of groups and rings. 

Suppose that M, , is the set of all p xq matrices, where 
p and q are certain fixed integers. Then the sum of any two 
members of M, , is defined and is a member of M, ,; 
hence M,, is closed with respect to addition. In fact, it 
is not hard to prove that M,, is an abelian group with 
respect to addition. The identity element of this group is 
the zero pxq matrix O;,,,) and the additive inverse of the 
matrix A = [a,,] is the matrix —A = [—a,,]: that is, the 
mattix whose (i, j)th element is —a;;. 

With respect to multiplication, M,,, is not closed unless 
p=4q. In fact, if p #q, the product of two elements of 
M,,q is not defined. However, the set M,, of all pxp 
matrices, where p is some fixed positive integer, is closed 
with respect to multiplication. Furthermore, M,, is a 


72 ELEMENTARY ABSTRACT ALGEBRA § 23 


semi-group with respect to this operation and it has an 
identity element, namely the unit matrix of order p. It does 
not form a group, for it is easily seen that 


Ow, pA = Op, p) 


for all Ae M, ,. Moreover, except in the case p = 1, there 
exist non-zero elements of M,,, which do not possess in- 
verses. For example, let A be the square matrix of order 
Pp (>1) such that every element of A is zero except for ay). 
Then for any other matrix Be M,,, the product AB is a 
matrix whose elements are all zero except possibly in the 
first row; thus there is no matrix B such that AB = [,,). 

Consider now the set M, , in relation to the two oper- 
ations of addition and multiplication. With respect to 
addition, M, , is an abelian group and with respect to 
multiplication it is a semi-group; also the distributive laws 
are satisfied. Therefore M,,, is a ring with respect to addi- 
tion and multiplication. 

Addition in the ring M,, , can be regarded as having been 
constructed out of the operation of addition of real num- 
bers, but it is important to remember that it is a new 
operation; the fact that the same symbol is used for both 
operations must not be allowed to result in confusion be- 
tween them. Multiplication in M, , can be regarded as 
having been constructed out of both addition and multipli- 
cation of real numbers; again it is important to emphasise 
the fact that it is a new operation. Similar methods of con- 
structing new operations in a set S’ out of given operations 
in a set S are encountered in dealing with other algebraic 
systems. In most cases of this type, the properties of the 
new operations in S’ such as commutativity and associativity 
are consequences of similar properties of the original oper- 
ation in S. For example, commutativity of addition in 
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M,,, p is a consequence of commutativity of addition of real 
numbers; associativity of multiplication in M,,, is a con- 
sequence of associativity of multiplication, commutativity 
of addition and distributivity of multiplication over addition 
for real numbers. 

These observations lead to the following generalisation 
of the rings M, ,. Let R be a ring and let A be a set of p’” 
elements a;, (i,j = 1, 2,...,p) in R, arranged in the array 


441 412 443-4. Q1p 
424 422 423..-- a2» 


Qn Qp2 aps Soc Qpp 
Let M, ,(R) be the set of all such arrays. In M, ,(R) we 
define two operations, addition and multiplication, as fol- 
lows. If A, Be M,, ,(R) we define the sum A+B to be the 
array C such that 


Cij = ajyjt+5;;, 
and we define the product AB to be the array D such that 


Pp 
dj; = ME Gi, Dy j- 


k=1 
In relation to these operations, the set M, ,(R) is a ring, 
which we call the ring of p x p matrices over R; the elements 
of M,,,(R) are called p x p matrices (or square matrices of 
order p) over R. 


Exercise 23.1. Prove that matrix multiplication is dis- 
tributive with respect to addition. 


Exercise 23.2. Prove that every identity element for 
multiplication in the set of all matrices is a unit matrix [(,) 
for some integer p. 

EAA F 
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Exercise 23.3. Prove that if R has an identity element 
with respect to multiplication, then M, ,(R) also has an 
identity element with respect to multiplication. 


§24. Fields. In the case of a ring we have seen that 
addition is ‘‘ well-behaved’ but that multiplication need 
not be: in the ring M,,, of square matrices of order n 
multiplication is not particularly well-behaved because it is 
not commutative and inverses do not necessarily exist. We 
now consider algebraic systems having two binary opera- 
tions, addition and multiplication, which both behave in 
a similar way to the corresponding operations on real 
numbers. Such systems will clearly resemble the real num- 
ber system closely in certain respects: however, as we shall 
see, they may possess properties which are very different 
from those of real numbers. 

Let S be any set having at least two elements and sup- 
pose that there are two closed binary operations, addition 
and multiplication, in S. If S is an abelian group with 
respect to addition, if the elements of S apart from 0 form 
an abelian group with respect to multiplication and if also 
the distributive laws 


x(y+z) = xy+xz, 
(x+y)z = xz+yz 


are satisfied for all x, y, z eS, then the set S together with 
these two operations is called a field.f According to this 
definition multiplication of non-zero elements of a field S is 
both commutative and associative. However, as in the 
proof of Theorem 22.1, we can show that x0 = 0. 


+ A separate volume in the present series Introduction to Field 


Te by I. T. Adamson, provides a more detailed account of this 
subject. 
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and similarly that 0x = 0. From those results it is a trivial 
matter to verify that multiplication in S is both commu- 
tative and associative even if one or more of the factors is 
the zero element. Thus a field is a non-zero ring in which 
multiplication is commutative, in which there is an identity 
with respect to multiplication and in which every non-zero 
element has an inverse with respect to multiplication. Alter- 
natively, we can list all the properties in full: a field is a set S 
together with two operations, addition and multiplication, 
such that all the conditions (1) to (8) in §22 are satisfied 
and also 


(9) xy = yx (all x, yeS), 
(10) there is an element 1 € S such that x1 = x = Ix 
for all xe S, 
(11) for each x € S such that x # 0 there exists an ele- 
ment x~' eS such that xx~! = 1 = x71x. 


The element 0 in a field is usually called the zero of the field 
and the element 1 is called the unity. These terms are also 
used, where appropriate, for rings. 


Example 24.1. The sets R, Q are fields with respect to 
ordinary addition and multiplication. The set Z is not a 
field with respect to these operations, because an integer 
other than 1 or —1 does not have an inverse in Z with 
respect to multiplication. The set M,_, of square matrices 
of order n (n>1) is not a field with respect to matrix addi- 
tion and multiplication: Again, there are elements of M,,, 
which do not have inverses; moreover, multiplication is not 
commutative. 

In certain cases we encounter systems which satisfy all 
the required properties for a field with one exception, the 
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commutative law of multiplication. Such systems are known 
as division rings or skew fields or more briefly as sfields. 
Indeed, some authors use the term “ field ” to include the 
non-commutative case. 


§25. Integral domains. Further examples of fields will 
be given in §§27, 28 and 29. In the present section, we pause 
to consider the position of the set Z of integers. As we have 
already seen, Z is not a field with respect to ordinary addi- 
tion and multiplication, but it is a ring with respect to these 
operations. However Z satisfies some of the additional 
requirements for a field and in many ways multiplication is 
a “‘ better behaved’ operation than matrix multiplication. 
For example, it is commutative, there is an identity element 
and if p,q are any two non-zero integers, the product pq is 
also non-zero. In the case of matrices, only the second of 
these three conditions is satisfied. We have already seen 
that matrix multiplication is non-commutative and to show 
that the third condition is not satisfied, we consider the 
2x2 matrices 


o-[2 o} o=[2 8] 


Both A and B are non-zero (we recall that a zero matrix has 
all its elements zero) but it is easily shown that the product 
AB is zero. 

A ring in which multiplication is commutative is called 
a commutative ring. A non-zero commutative ring in which 
there exists a unity and which is such that if x # 0 and 
y # Othen xy ¥ 0 is called an integral domain. The set Z 
is an integral domain with respect to ordinary addition and 
multiplication and any field F is an integral domain; for if 
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x, ye Fand xy = 0, y ¥ 0, then y~! exists and we have 


x = x(yy™*) = (xy)y™* = Oy = 0. 


Thus xy = 0, y # 0 imply that x = 0, showing that the 
product of two non-zero elements of F cannot be zero. 

In general, if x is an element of a commutative ring such 
that x # 0 and xy = 0 for some non-zero element y, then 
x is said to be a divisor of zero in the ring. 


Exercise 25.1. Prove that, in an integral domain, the 
cancellation law 
Hoc, a = 0 = b= Cc 
holds. 


Exercise 25.2. If Ris acommutative ring having a unity, 
prove that an element x € R is not a divisor of zero if it has 
an inverse with respect to multiplication. 


Exercise 25.3. Let R be a commutative ring having a 
unity. In the set R’ of ordered pairs (x, y) where x, ye R, 
define addition and multiplication by 


(X14, Vs) + (Xo, V2) = 1 +%2, Vi +2) 
(X15 V1)(%25 V2) = (% 1X2, Wi1V2) 


Prove that with respect to these operations R’ is a commu- 
tative ring with a unity. Show also that R’ is not an integral 
domain even if R is a field. 


§26. Isomorphism and homomorphism of rings. In §19 
we introduced the idea of homomorphism of groups. In the 
case of rings a similar idea is important. Intuitively, an iso- 
morphism of a ring R, onto a ring R, is a one-one corres- 
pondence which preserves both the operations of addition 
and multiplication. Mathematically, an isomorphism of a 
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ring R, onto a ring R, is a one-one mapping f of R, onto 
R, such that, for all x,, x, € R,, we have 


Sf(X1 +2) = f(%1)+f%2), 
S(%1X2) = f(%1)f (2). 


Using arguments similar to that employed in §19, we can 
show that if f: R, — R, is an isomorphism so is the inverse 
mapping f~1:R, > R,. Thus if R, is isomorphic to R, 
then R, is isomorphic to R,. 

In a similar manner, we define isomorphisms between 
integral domains and between fields. As in the case of 
groups, isomorphism between rings determines an equi- 
valence relation and the corresponding equivalence classes 
can be regarded as abstract rings of which particular 
members of the classes are realisations or representations. 


Example 26.1. Let R be any ring and let M(R) be the 
set of all » xn matrices over R of the form 


x O50) 6 a 
0 se @ son @ 


O50 Ohne seee 
where xe R. Let f: M(R) > R be the mapping defined by 
J(X) = x. Suppose that 


Mi 


0 0 
0 y O 
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If f(X) = f(¥), then x = y and so Y = Y. Hence fis a 
one-one mapping. Given x ¢€ R, there exists X € M(R) such 
that {(X) = x; hence fmaps M(R) onto R. For all X¥, YeR 
we have 


CY) ty = JX) y) 


POY) xy = f(X) (1). 


Hence f is an isomorphism. 

The idea of a homomorphism plays an important part in 
the theory of rings. A mapping f of aring R, into aring R, 
is called a homomorphism if 


(x+y) =f) +f0) 
F(xY) = FC)SO) 


for all x, y¢.R,. Thus, as in the case of groups, a homo- 
morphism is not required to be a one-one mapping of one 
system onto another, but relates the structures to the extent 
that the image of a sum or product is the sum or product of 
the corresponding images. 


and 


§27. The field of rational numbers. In §2, we defined a 
rational number to be a real number of the form a/b, where 
a is an integer and 5 is a positive integer. This definition 
has the disadvantage that the same number is represented 
in different ways; for example 3/6 = 1/2. It also requires 
us to have some idea as to what is meant by a real number, 
which is a complicated concept. We now explain a more 
satisfactory way of defining the field of rational numbers 
starting with the integral domain Z. The method is appli- 
cable to any integral domain and so enables us to construct 
other examples of fields. 

We begin by considering ordered pairs of integers (a, 5), 
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where b is non-zero. We define addition and multiplication 
of such ordered pairs according to the rules 


(a, b)+(a’, b’) = (ab' + ba’, bb’) 
(a, b) (a’, b’) = (aa’, bb’). 


What we have in mind here is that the ordered pair (a, b) 
corresponds to the number a/b. This determines the way in 
which we write down the above definitions. 

Thus we have defined closed binary operations in the 
set S of all ordered pairs (a, b) of integers, where b # 0. We 
now say that the ordered pair (a,, b,) is equivalent in S to 
the ordered pair (a2, b,) if 


a,b, = azb,. 


It is easily seen that this determines an equivalence relation 
in S. For clearly the relation is reflexive and symmetric; to 
prove that it is transitive, suppose that (a,, b,) is equivalent 
to (a2, b,) and that (a2, b,) is equivalent to (a3, b,). Then 


a,b, = a,b, and a,b, = a3b2. 
Therefore 
a,b,a,b; = azb,a3b2, 


so that, by the commutative property of multiplication of 
integers we have 


a2b,(a,b3—a3b,) = 0. 


Hence, if a, 4 0, we have a,b; = a3b, since b, # 0 by 
definition. If a, = 0, then the original equations show that 
a, = 0 = a3, so that again we have a,b, = a3b,. 

The equivalence relation just defined is stable with res- 
pect to both the operations of addition and multiplication 
of elements of S. For suppose that (a,, ,), (aj, 64) are 
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equivalent to (a2, b,) and (a, bi) respectively. Then 
a,b, = ayb,, a,b) = ayb}. 


Therefore 


(415i +b144)b2by = (a,b) + b244)b 1b} 


and 
a,ab,b, = a,a,b,b}, 


so that (a,, b,)+(a}, 64) is equivalent to (a2, b.)+(a}, b4) 
and (a;, b,) (a), b}) is equivalent to (a2, b,) (a3, 55). Hence 
if Q is the set of equivalence classes, we can define the 
induced operations corresponding to addition and multipli- 
cation in S. We call the elements of Q rational numbers 
and we can verify that Q is a field with respect to addition 
and multiplication defined in this way. The identity with 
respect to addition is the class (0, 1)* consisting of elements 
of S of the form (0, 5); the identity with respect to multipli- 
cation is the class (1, 1)* consisting of the elements (a, a). 
The inverse of (a, b)* with respect to addition is (—a, b)* 
and, if a # 0, the inverse of (a, b)* with respect to multipli- 
cation is (b, a)*. The commutative, associative and distri- 
butive laws are easily verified directly from the definitions 
of addition and multiplication in S. 

In the above definition of the rational field, we assumed 
that the elements a, b were integers. If we examine the 
argument carefully, we see that it is still valid if a, b are 
elements of any integral domain. Thus, starting with any 
integral domain D, we can construct a field by carrying out the 
above process. This field is called the field of quotients of D. 

According to the above definition, a rational number is 
a certain equivalence class of ordered pairs (a, b), where a 
and b are integers and b # 0. Instead of denoting the 
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equivalence class containing (a, b) by (a, b)*, we usually 
denote it by . It should be remarked that ; does not stand 


for the number pair (a, b) but for the equivalence class to 
which (a, b) belongs. Thus when we write 2 = 4 we are really 
expressing the fact that both pairs (2, 3) and (4, 6) belong 
to the same equivalence class, which we can denote either 
by 2 or by 4. 

If an ordered pair of integers (a, b) is such that b is a 
factor of a, then, from the definition of equivalence given 
above, every ordered pair (a,, b,) equivalent to (a, b) is such 
that b, is a factor of a,. In fact, if a = nb, then (a, b) is 
equivalent to the ordered pair (n, 1). In this case the corres- 


ponding rational number can be denoted by - Let Z’ be 


the set of all rational numbers of the form = For addition 


in Z', we have 
k, k, k,+k, 
ci gp EN 


aa (27.1) 
ky k, _ kk, 
7 (27.2) 


Because of these properties, a rational number of the form 
k we samiit ee Seca: 
i can be “identified ’’ with the corresponding integer k. 


Conditions (27.1) and (27.2) show that the mapping 
f: Z— Z' defined by 


f(n) = 5 


is an isomorphism, so that Z’ and Z are essentially the same 
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apart from the nature of their elements. Thus the rational 
field Q contains a subset which is isomorphic to Z. More 
generally, the field of quotients formed from an integral 
domain D contains a subset which is isomorphic to D. 


§28. The field of complex numbers. Logically, we ought 
at this stage to formulate a precise definition of real numbers 
before we employ them to construct complex numbers. To 
do this satisfactorily would unfortunately require more 
space than is available in a book of this kind and would 
take us beyond the realms of elementary abstract algebra 
into analysis. We assume therefore that the reader has an 
intuitive concept of what a real number is. It might be 
regarded, for instance, as a number which represents the 
distance between two points on an infinite straight line. 
There is no harm in adopting this point of view as a tem- 
porary measure provided it is realised that we have not 
formulated an algebraic definition of a real number and 
that we have not derived for real numbers their familiar 
properties such as those of commutativity and associativity 
of addition and multiplication. 

Intuitively the idea of a complex number is based on the 
notion of extending the system of real numbers in order to 
attach a meaning to square roots of negative numbers. This 
can be done by introducing a ‘‘ number ”’ i satisfying the 
property i? = —1; this number is presumed to obey all the 
usual laws of elementary algebra concerning addition and 
multiplication. We are then led to a study of numbers of 
the form a+ib, where a and b are real numbers; using the 
standard formula the roots of any quadratic equation with 
real coefficients can be expressed in terms of numbers of 
this form. Such numbers are called complex numbers. Each 
complex number is the “‘ sum ”’ of two parts; to add together 
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two numbers we simply add together the corresponding 
parts, so that 


(a+ib)+(a'+ib’) = (at+a’)+i(b+b’'). 


In multiplying together two such numbers we use the pro- 
perty i? = —1, so that we get 


(a+ ib) (a’' + ib’) = (aa' — bb’) + i(ab’ +a’b). 


To a certain extent, the above method of defining com- 
plex numbers is unsatisfactory, because it can be argued 
that we are not entitled to assert that there is a number i 
such that i? = —1. To meet this objection, we can proceed 
as follows in defining complex numbers. 

Let C be the set of ordered pairs (a, b) of real numbers. 
Define addition in C by 


(a, b)+(a’, b’) = (at+a’, b+’) (28.1) 
and define multiplication anew by 
(a, b) (a’, b’) = (aa’ — bb’, ab’ + ba’). (28.2) 


In formulating these definitions, we are influenced by the 
rules given above for the addition and multiplication of 
“numbers”? of the form a+ib. It should be emphasised 
that although we are again dealing with ordered pairs we 
are making quite different definitions of addition and multi- 
plication from those made in §27. It is easily seen that the 
set C forms a field with respect to these operations. For 
example, multiplication is distributive with respect to 
addition, since 


{(a, b)-+(a’, b’)} (a’, b") = (ata', b+b') (a", b’) 
= ({at+a'}a"—{b+b'}b", {ata'}b" + {b+b'}a") 
a= (aa" — bb", ab" + ba")+(a’a"—b'b’, a’b" + b’a") 
= (a, b) (a’, b”)+(a’, b’) (a", b”). 
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The zero element of this field is the pair (0, 0) and the 
unity is (1, 0). The inverse of (a, 6) with respect to addition 
is (—a, —b) and if (a, b) is not the zero element, that is to 
say if a, b are not both zero, then (a, b) has an inverse with 
respect to multiplication, namely, 


ma —b 

a+b? a?+b7/ 
An element of the field C is called a complex number. The 
real number a is called the real part of the complex number 


(a, b) and the real number 3 is called the imaginary part. 
Given a complex number (a, b), we can verify that 


(a, b) = (a, 0)+(0, 1) (, 0). (28.3) 


Thus every complex number can be expressed in terms of 
complex numbers of the form (x, 0) and the complex num- 
ber (0, 1). Addition and multiplication of numbers of the 
form (x, 0) is given by 


(x1, 0)+(X2, 0) = (41 +2, 0) _ (28.4) 
(x1, 0) (x2, 0) = (X1X2, 0). (28.5) 


Because of these properties we can “ identify’? a complex 
number of the form (x, 0) with the corresponding real num- 
ber x. Let R’ be the set of all complex numbers of the form 
(x, 0). By (28.4) and (28.5), the mapping f: R > R’ defined 
by 

f(x) = (x, 0) 


is an isomorphism. Then if we write a for the complex 
number (a, 0), b for (b,0) and i for the complex number 
(0, 1), by (28.3) we have 


(a, b) = a+ib. 
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Moreover, it is easily verified that (0, 1)? = (—1,0) = 
—(1, 0), so that we have i? = —1. Thus we have recovered 
the notation used in our intuitive discussion on the idea of 
a complex number. For most practical purposes, it is more 
convenient to write a+ib rather than (a, b). 

The complex numbers a+ib and a—ib in which a, b are 
real are said to be conjugate to one another. If z = a+ib is 
a complex number its conjugate is frequently denoted by 2. 
The bar over any complex number z denotes that the sign 
of the imaginary part is to be changed. Observe, how- 
ever, that if w and v are two arbitrary complex numbers, the 
complex numbers u+iv and u—iv are not as a rule conju- 
gate. For, suppose u = a+ib, v = c+id where a, b, c,d 
are all real. Then 


ut+iv = (a—d)+i(b+c), 
u—iv = (a+d)+i(b—c), 
whereas 


u+iv = (a—d)—i(b+c). 


If z = a+ib, where a, b are real then zZ = (a+ib)(a—ib) 
= a’+b*. Thus zz is always a positive real number whose 
positive square root +.,/(a* +b’) is called the absolute value 
or modulus of z. The modulus of z is denoted by | z |. 

As in §27, we can easily see that other algebraic systems 
can be defined in a similar way by replacing the real numbers 
a, b in the ordered pairs by elements from some different 
field; however, the system obtained is not necessarily a 
field, for it may happen that the inverse of a non-zero 
element with respect to multiplication does not exist. In 
the case we considered above, we used implicitly the pro- 
perty of real numbers that if a?+5? = 0 then both a and b 
are zero. There are fields in which this property does not 
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hold; in particular it does not hold in the complex field 
itself. 


Exercise 28.1. Let M be the set of all 2x2 matrices of 
the form 


where a and b are real numbers. Prove that M is a field with 
respect to matrix addition and multiplication. Prove also 
that the correspondence in which we associate the matrix 


[ia] 


with the complex number a+ib is an isomorphism of the 
complex field C onto the field M. 


§29. The ring of residue classes mod n. Suppose that n 
is some fixed integer and that m,,m,¢Z. We say that m, 
is congruent to m, modn and write m, = m, (modn) if 
m,—m, is divisible by n. As we saw in §4, this determines 
an equivalence relation in the set Z. The corresponding 
equivalence classes are called residue classes mod n. Each 
consists of all integers which leave the same positive re- 
mainder on division by n; thus the residue class containing 
1 consists of 1, l-+-n, 1+2n, 1+3n etc. There are altogether 
n distinct residue classes, corresponding to the n different 
non-negative remainders on division by n. We denote the 
set of residue classes by Z,. If r* denotes the residue class 
containing r, then the distinct elements of Z, are 


OF 1* en, Gi 1)". 


The equivalence relation = (mod n) is stable with res- 
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pect to both addition and multiplication in Z. This is easily 
verified (we have already mentioned the fact that = (mod 6) 
is stable with respect to addition: see §5). Therefore we can 
define addition and multiplication as the induced operations 
in Z,. Since Z, is finite, each of these operations is com- 
pletely specified by a table. For example, in the case n = 6 
the addition table is as given in §5, and the multiplication 
table is 


ONT 2 3 “4s 
0 0.050" Oret. 20a 
1 0. l “2 Bass 
2 0 2 4 0 2054 
3 0 3: 0 33770. 33 
4 O 4 2 0 4,22 
= OS “4 se 2 el 


Again we have, for convenience, omitted the stars in writing 
down this table. 

We now consider the properties of addition and multi- 
plication in the set Z,. We observe first of all that Z, is an 
abelian group with respect to addition. That addition is a 
closed, commutative and associative operation is easily veri- 
fied; the identity with respect to addition is the zero element 
0* and the additive inverse of r* is (—r)*, which is the same 
as (n—r)*. Multiplication, however, is less well-behaved. It 
is certainly a closed, commutative and associative operation 
in Z, and there is an identity element, namely the residue 
class 1*. But the multiplicative inverse of a non-zero element 
of Z, does not necessarily exist: for example, as the above 
table shows, none of the elements 2*, 3*, and 4* has an 
inverse in the casen = 6. In fact, these elements are divisors 
of zero; for example 2*3* = 0*. 
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Thus in general, Z, is a commutative ring with a unity, 
but it is not necessarily an integral domain. 

Suppose now that n is a prime number p. If m,, m, are 
integers such that the product m,m, is divisible by p, then 
one of m, and m, is divisible by p. In other words if 
mm, = 0 (mod p), then either m, = 0 (mod p) or m, = 0 
(mod p). In this case, therefore, 

mim; =0* = either mt = 0* or m3 = 0*, 
so that Z, is an integral domain. In fact, we can say more; 
for if mf # O*, then m, and p have highest common factor 
1 and so there exist integers h, k such that m,h+pk = 1. 
(The reader who is not familiar with the result used here 


should accept it for the time being. It follows from a general 
result (Theorem 37.1) proved in Chapter IV.) Therefore 


m,h = 1 (mod p), 
so that 
mth* =1* (= h*mj). 


Hence any element mf of Z, other than 0* has an inverse 
with respect to multiplication; in other words, Z, is a field. 

If m is not a prime number, then Z, is not a field, nor 
even an integral domain. For if n is not prime, there exist 
integers m,, mz such that 


l<m,<n, 1<m,<n 
and 
mm, = Nn. 
Then we have 


ain, — On 
although m* # 0*, m} # 0*. So in this case Z, has divisors 


of zero. 
EAA G 
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Exercise 29.1. Using the fact that the non-zero elements 
of Z, (where p is a prime) form a group with respect to 
multiplication and using also Lagrange’s Theorem (see §16), 
prove that for any non-zero element aeéZ,, we have 
a’~! = 1, Hence prove Fermat’s Theorem, namely that if 
nis any integer and p is a prime then n?—n is divisible by p. 


§30. Subrings. A subset S of a ring R is called a subring 
of R if S is itself a ring with respect to the operations of 
addition and multiplication in R. 

From the definition of a ring, it follows that a subset S 
of a ring R is a subring of R if and only if the following 
conditions are satisfied : 

(i) if x, ye S, then x+yeS, 

(it) addition is commutative and associative in S, 

(iii) S contains an element u such that 
x+tu = x = u+x for all xin S, 

(iv) for every x € S, there exists an element x’ e S such 
that 
XX = = x +x, 

(v) if x, ye S, then xyeS, 

(vi) multiplication is associative in S and is distributive 

with respect to addition. 


We first observe that some of these conditions hold be- 
cause S is a subset of a ring R; in fact (ii) and (vi) are satis- 
fied automatically. (A similar situation occurs in the case 
of subgroups of a group; see §16.) Again, we can show that 
(iii) is satisfied if and only if 0 € S, the element wu then being 
equal to 0; also (iv) is satisfied if and only if —x eS, the 
element x’ then being equal to —x. Therefore a subset § 
is a subring of a ring R if and only if S is closed with respect 
to addition and multiplication and contains 0 and the addi- 
tive inverse of every element of S. 
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Example 30.1. Z is a subring of R. 


Example 30.2. A square matrix [a;,] of order 7 is called 
an upper triangular matrix if a;; = 0 whenever i > j. The set 
of all upper triangular matrices of order n, over a ring R, 
is a subring of the ring M,, ,(R) of matrices over R. 


Exercise 30.1. Let R be a ring and let S be a subset 
of R. If S is a subgroup of R with respect to addition 
and is closed with respect to multiplication, prove that 
S is a subring of R. 


Every ring R has two trivial subrings: R itself and the 
set consisting of the zero element alone. We now examine 
some subrings which in general are non-trivial. 

Suppose that R is a ring with a unity e. It is natural to 
write 

2e =et+e, 3e = 2et+e,..., 


as in §14. Similarly we write 
(—2)e = (—e)+(—e), (—3)e = (—2)e+(—e),.... 


Thus we define ne for any ne Z. Let N denote the subset 
of all elements of R of the form ne, where ne Z. It is easily 
verified that N is a subring of R (to do this, we can use the 
result of the above exercise). 

In certain cases, the subring N consists of a finite number 
of elements. For example, in the field Z, of residue classes 
mod 2 (see §29), there are just two elements 0 and e (in §29, 
these were denoted by 0* and 1* respectively). We have 


fe— 0; Je = e€, 46 —,..., 


and so in this case N coincides with Z, itself and consists of 
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two elements. More generally, in the ring Z, we have 
ne = 0, (nt De =e, (n+2)e = 2e,..., 


and N coincides with Z,,. 

On the other hand, if R is a ring in which ne ¥ 0 for all 
non-zero integers n, then the subring N of R is infinite, for 
if pe = ge and p # q then (p—q)e = 0, which contradicts 
the assumption ne # 0; hence all integral multiples of e are 
different. 

For any ring R, it is easily seen that there is a homo- 
morphism of the ring Z of integers onto the subring N of R; 
in fact the mapping f: Z > N defined by 


f(n) = ne (néZ) (30.1) 


is a homomorphism. In the following theorem, we examine 
the situation more closely. 


THEOREM 30.1. Let R be a non-zero ring with a unity. 
Then there exists a subring N of R which is isomorphic to 
either the ring Z of integers or one of the rings Z, (n=2) 
of residue classes mod n. 


Proof. Suppose that in the subring N defined above we 
have ne # 0 for all non-zero integers n. Then the mapping 
f: ZN defined by (30.1) is an isomorphism, for if 
S (ny) = f(m2), then nye = nye, so that (n, —n,)e = 0, which 
implies that n,—n, = 0. Therefore in this case N is iso- 
morphic to Z. 

Suppose now that there is a non-zero integer n (> 2) such 
that ne = 0. We shall assume that 7 is a positive integer 
and that ke 4 0 for k = 1,2,...,n—1; clearly no gener- 
ality is lost when this assumption is made. Then the elements 


0) 6, 2E,)<:4-5 sGiaee 
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are distinct, for otherwise there would exist a positive integer 
k such that ke = 0 and I1Sk<n. If m is any integer and 
gq, r are the quotient and remainder respectively on division 
of m by n, we have 


me = (qn+r)e = q(ne)+re = re 
since ne = 0. Hence N contains exactly n distinct elements, 
namely 
0, e, 2e,...,(n—I)e. 
We can therefore define a one-one correspondence 
g:Z, — N between Z, and N by 
ics) — Ke Ue — Olen — 1) 


and it is easily shown that this mapping is an iso- 
morphism. © 


Exercise 30.2. Prove that the subset of the complex field 
consisting of all complex numbers of the form m+ in, where 
m and n are integers, is a subring of C. 


Exercise 30.3. If D is an integral domain and S is a sub- 
ring of D containing the unity of D, prove that S is an 
integral domain. 


Exercise 30.4. Prove that the ring M,_,(R) of matrices 
over a ring R contains a subring isomorphic with R. 


§31. The characteristic of an integral domain. Let R be 
a ring, not necessarily possessing a unity. If ae R, we can 
define ‘‘ multiplication ” of a by integers as follows (see § 14): 


Oa = 0) la=a, 2a = a+¢4, 34= 24a+4,..., 
(—la = —a, (—2)a = (—a)+(—Q),.... 


This notation is similar to that used in §30, but it is now 
applied to any element of R. 
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Multiplication of the element a by integers as defined 
above must not be confused with the operation of multipli- 
cation in R. It is a different type of operation, because it 
operates on pairs in which the two elements are, in general, 
in different sets. We shall meet further examples of this 
type of operation when we deal with polynomials and with 
vector spaces. 

It is easily verified that multiplication of elements of R 
by integers satisfies the following: 


(m+n)a = ma+na, 
m(na) = (mn)a, 
m(a+b) = ma+mb, 
m(ab) = (ma)b = a(mb), 


where a, be Rand m,ne Z. 
Suppose now that D is an integral domain, with unity e. 
Then we have 


na = n(ea) = (ne)a. Git} 


Therefore na = 0 if ne = 0. Equation (31.1) also shows 
that ifna = Oanda ¥ 0, then ne = 0, because in an integral 
domain bc = Oimplies that either b = Oorc = 0. Ifna = 0 
and x is an arbitrary element of R, it follows that 


nx —ane)x = Ox =), 


Thus in an integral domain either ne = 0 and nx = 0 for 
all x € D or else nx ¥ O for all non-zero x € D. 

Suppose first that D is an integral domain such that 
ne # 0 for all positive integers n. Then we have nx ¥ 0 for 
all non-zero integers n and all non-zero x € D. In this case 
we say that D has characteristic zero. For example, the real 
numbers R form a field of characteristic zero, and the in- 
tegers form an integral domain of characteristic zero. 
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Suppose now that there exists an integer n (40) such 
that ne = 0; then, as we have seen, we have nx = 0 for all 
xe WD. There is no loss of generality in assuming that n is 
positive, for (—n)x = 0 if and only if nx = 0. Let p be the 
least positive integer n such that nx = 0. Then we have 


pe=0=px (allxeD) 
and 
ne#0, nx #0 


for all non-zero x € D and all n such that 0<n<p. In this 
case we say that the integral domain has characteristic p. 
We now show that the characteristic p of an integral 
domain is a prime number. 
Suppose that p has a factor k where 1<k<p; then 
Pp = kq for some integer g such that 1<q<p. Then gx #4 0 
if x 4 0 and also 


0 = px = (kq)x = k(qx). 


Hence if y = gx we have y #0 and ky = 0; but k is a 
positive integer less than p and so we have a contradiction. 
The required result now follows. 

From the results of §30, we now see that an integral 
domain of characteristic zero has a subring isomorphic with 
the ring of integers and that an integral domain of charac- 
teristic p (where p is necessarily a prime, as we have shown) 
has a subring isomorphic with the field Z, of residue classes 
mod p. 

A field which has only a finite number of elements is 
known as a finite field. The field Z,, as well as having 
characteristic p, contains exactly p elements, and so is an 
example of a finite field. Another example is the set S con- 
sisting of four elements 0, 1, a, b, together with addition 
defined by the table 
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OiteelucnGierd 
0 O° 1 aarp 
1 1 O0fe" 42 
a a” sb" 0 Fal 
b b a ! © 

and multiplication defined by the table 

OO lL waa £6 
0 0 0 OF 8 
1 O° tanec 
a 0 a b 1 
b 0 6 1 @ 


This is an example of a field with four elements and with 
characteristic 2. In general, the order of a finite field (that 
is, the number of elements in the field) is a power of the 
characteristic, as the following theorem shows. 


THEOREM 31.1. Let F be a finite field. Then F has charac- 
teristic p, where p is a prime number and the order of F is a 
power of p. 

Proof. The characteristic of F is not zero, because F is 
finite and so cannot contain a subring isomorphic to the 


ring Z. Hence the characteristic of F is a prime number p. 
The elements 


le, 2e, 3e,...,(p—l1)e, pe(=0) 


of Fare distinct. If this exhausts F (as it does in the case of 
Z,) then the order of Fis p. If not, there exists an element 
a not in this set and then F contains the p? elements 


me-+na 
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where 1 SmSpand1<ngp. Ifm,, m, are particular values 
of m and n,, n, are particular values of n such that 


mye+nya = mze+n24a, 
then 
(m,—my2)e = (n,—n,)a. 


Since both n,,, are positive integers less than p and p is 
a prime, the integers n,—n, and p have highest common 
factor 1 or else ny = nj. Therefore if m, 4 n, there exist 
integers r and s such that (proved in Chap. IV) 


r(n.—n,)+sp = 1. 
Hence since px = 0 for every xe F, 
r(m,—m,)e = (1—sp)a = a. 
It follows that a is of the form ke for some integer k; 
but every element of this form is in the set 


le, 2e, 3e,...,(p—l)e, pe 
and so we have a contradiction. Therefore n, = n, and also 
(m,—mz)e = (n,—n,)a = 0, 


which shows that m, = mz. From this we deduce that no 
two elements of the form me+na (lSmSp, 1<nSp) co- 
incide, so that we have a list of p? distinct elements of F. 
If this exhausts F, the order of F is p”; if not we can repeat 
the process, obtaining a set of p® distinct elements of F and 
can continue until all the elements of F are included. At 
each stage of the process, we have a set of p’ distinct ele- 
ments of F for some positive integer tf; the process stops if 
this exhausts F and continues if it does not exhaust F. 
Since F is finite, the process must stop after a finite number 
of stages. © 


It can be shown that for each prime number p and each 
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positive integer n, there is essentially only one field consist- 
ing of p" elements; this field is denoted by GF(p") and is 
called a Galois field. 


§32. Ideals and factor rings. We conclude this chapter 
by introducing for rings the analogues of normal subgroups 
and factor groups. 

The notion of an ideal is analogous to that of a normal 
subgroup. The situation is made more complicated by the 
fact that we now have two operations to consider, but basic- 
ally the procedure is the same. 

Let S be a subring of a ring R and consider the sets $ 
and R in relation to addition. The set R is an abelian group 
with respect to addition and S is a subgroup of this group. 
Thus S$ is a normal subgroup and every left coset of S coin- 
cides with the corresponding right coset. Since we use the 
additive notation for these groups, we write x+ S for the 
coset of S containing x. As in §17, the equivalence relation 
~ (mod S) defined by x~y (mod S) whenever x, y belong 
to the same coset of S, is stable with respect to addition. 
Therefore, in the set of cosets x +S, we can define the in- 
duced binary operation, which we call addition of cosets. 

Multiplication, however, is less straightforward. For the 
stability of the equivalence relation ~ (mod S) with respect 
to multiplication, we require the following condition (c) to 
be satisfied: 


X,~y; (mod S), x, ~y2 (mod S) > x1x.~y,y2 (mod S). 


We proceed by analogy with the argument used in §17. 
Suppose that x,;~y, (mod S) and x,;~y, (mod S). Then 
X,; = yy +5, and x, = y,+5, where s,,5,¢S. Hence 


X1X2 = VyY2 FY 1S. +512 451582, 
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and so x;X2 = y,;2+s, where s is an element of S, if and 
only if 


V182 FS; 2 +5482 


belongs to S for all y,, y, ¢ R and all s,,5,¢S. Since S is 
a subring, s,s, € S for all s,, 5, ¢ S and so we require 


y152+51y2ES (32.1) 


for all y,,y,¢R and all s,,s,¢S. If we put y, = 0 in 
(32.1) we see that we require 


yy52ES (3222) 


for all y,; e R and all s, eS. Similarly, if we put y, = 0 in 
(32.1), we see that we require 


Siy2ES (3233) 


for all s,; ¢ S and all y, eR. Thus, if (32.1) is satisfied, so 
are (32.2) and (32.3). Conversely, if (32.2) and (32.3) are 
satisfied, so is (32.1). Conditions (32.2) and (32.3) are 
therefore necessary and sufficient for the equivalence rela- 
tion ~ (mod S) to be stable with respect to multiplication. 

This leads to the formulation of the following definition. 
A subset S of a ring R is said to be an ideal if S is a subgroup 
of R with respect to addition and if, for all x € R and all 
sé S, we have 


xseS, sxeS. (32.4) 


It should be noticed that it is not necessary to state that S 
is a subring with these properties because if the properties 
are satisfied, then S is necessarily a subring; for S is a sub- 
group with respect to addition and is closed with respect to 
multiplication by (32.4) with x € S, so that, by the result in 
Exercise 30.1, S is a subring. 
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Exercise 32.1. Prove that the set of even integers is an 
ideal in the ring Z of integers. 


Exercise 32.2. Let R be a ring with a unity and let S be 
a non-empty subset of R such that for allxe Randallse S 
we have xse S and sxe S. If S is closed with respect to 
addition, prove that S is an ideal of R. 


Exercise 32.3. If S is an ideal of a ring R, prove that the 
ring M, ,(S) of matrices over S is an ideal of the ring 
M,,,,(R) of matrices over R. 


Every non-zero ring has two ideals, namely the ring 
itself and the subset consisting of 0 alone. Any ideal which 
is not one of these two is called a proper ideal. 


THEOREM 32.1. A field has no proper ideals. 


Proof. Let S be an ideal in the field F. Suppose that S$ 
contains at least one non-zero element a. Then a has an 
inverse with respect to multiplication and so 


=a‘aeS. 
Therefore, if x is any element of F, we have 


Sea sles 
and hence S = F. © 


In general, a non-zero ring which has no proper ideals 
is called simple. The above theorem shows that a field is 
simple. There exist simple rings which are not fields; for 
example, the ring M,, ,(F) of all square matrices of order n 
with coefficients in a field F is simple. The proof of this is a 
good exercise in the manipulation of matrices, which the 
reader should attempt once familiarity with the technique 
of matrix multiplication has been acquired. 
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If S is an ideal in the ring R the equivalence relation 
(mod S) is stable with respect to both addition and multipli- 
cation; the definition of an ideal was framed with these 
properties in mind. Thus we can introduce the correspond- 
ing binary operations into the set of cosets of S in R: we 
shall call these operations addition and multiplication res- 
pectively. By analogy with Theorem 18.1, we can prove that 
the set of all cosets of S in R is a ring with respect to these 
operations: this is called the factor ring or difference ring 
or quotient ring of R by S and it is denoted by R/S. 


Example 32.1. If Z is the ring of integers and S is the 
ideal consisting of all the even integers, then the factor ring 
Z/S contains just two elements—the set of even integers and 
the set of odd integers. In fact Z/S is isomorphic to Z). 


Just as in the case of groups, there is a connection be- 
tween the concepts of homomorphism and factor ring. The 
kernel and image of a homomorphism can be defined as in 
§20; the image Im f of the homomorphism f: R, > R, is 
the subset of elements of R, of the form f(x) for some 
x € R, and the kernel Ker fis the subset of elements y of Ry; 
such that f(y) = 0. Im fis a subring of R, and Ker fis an 
ideal of R,; the factor ring R,/Ker/f is isomorphic with 
Im f. This result is analogous to Theorem 20.2; the ana- 
logue of Theorem 20.1 is that for any ideal S in a ring R, 
there is a homomorphism (called the natural homomorphism) 
of R onto R/S. 


Exercise 32.4. Let Z be the ring of integers and let S, 
be the subset consisting of all integers divisible by n, where 
n is a positive integer. Prove that S, is an ideal of Z and 
that Z/S, is isomorphic to Z,,. 


Exercise 32.5. If Ris any ring and k is a positive integer, 
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the kth power of R is the ring R* whose elements are all 
finite sums of products of k elements of R. Prove that R* 
is an ideal of R. 


Exercise 32.6. A ring N is said to be nilpotent if there 
exists a positive integer v such that N” = 0 (that is, VN’ con- 
sists of the zero element alone). Prove that if R is any ring, 
then the ring R/R* is nilpotent. 


CHAPTER IV 
POLYNOMIAL AND EUCLIDEAN RINGS 


§33. Introduction. The idea of a polynomial is encoun- 
tered in elementary algebra, but a precise definition is not 
easily found. Literally, the word “‘ polynomial ’’ means 
** consisting of more than one term’’. In particular, a bi- 
nomial consists of two terms and a trinomial consists of 
three terms. A monomial is an expression consisting of a 
single term and in the literal sense it is not a particular case 
of a polynomial. However, mathematicians restrict the 
meaning of the word polynomial to expressions which are 
sums of multiples of positive integral powers of certain un- 
specified elements, and regard a monomial as a special case 
of a polynomial. Thus x? is a polynomial in x and in parti- 
cular is a monomial; x+y is a polynomial in x and y and is 
a binomial; x?+2x—1 is a polynomial in x and is a tri- 


: 1 : ee 
nomial; x +gxy'zt—ey +3 is a polynomial in x, y and z. 


On the other hand, x+sin x is not a polynomial in x, but 
it is a polynomial in x and sin x. Similarly, x+1/x? is not 
a polynomial in x, but it is a polynomial in x and 1/x. 

In these examples, we can regard x, y and z as being 
elements taken from some system in which addition and 
multiplication are defined and in which the coefficients have 


a meaning (for example, the field of real numbers). For the 
103 
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purposes of abstract algebra, however, this point of view is 
inappropriate, because it is too restrictive. Algebraists re- 
place the idea of a polynomial in certain variables by that 
of a polynomial in certain indeterminates: by an indeter- 
minate is meant an object whose nature is not specified but 
which is assumed to obey certain laws. But this is an im- 
precise idea, having similar drawbacks to those encountered 
when complex numbers are defined by introducing a num- 
ber i such that i? = —1 (see §28). In §§34 and 35 we give 
a precise definition of a polynomial in one “ indeterminate ” 
and introduce addition and multiplication of such poly- 
nomials. The ideas are based on the imprecise idea of a 
polynomial as a formal sum 


Pot DX pee +... pee 


and we show how the familiar notation used in this expres- 
sion can be recovered, but with a precise meaning attached 
to x and to the symbol +. Similar definitions can be given 
for polynomials in several indeterminates, but only poly- 
nomials in one indeterminate are considered in this book. 
In §35 we show that with each ring R we can associate 
another ring P(R) whose elements are polynomials. The 
construction is analogous to that described in §23, in which 
we showed that with each ring R we can associate another 
ring M, ,(R) whose elements are matrices. Some simple 
relationships between R and P(R) are discussed. The case 
in which R is a field is of special interest; the ring P(R) is 
then an integral domain which has several similarities to the 
integral domain Z. The idea of a Euclidean ring, which is 
introduced in §37, provides a unifying concept. As shown 
in §§37 and 38, properties connected with division and fac- 
torisation, generalising familiar properties of the integers, 
play an important part in the study of Euclidean rings. In 
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particular, the idea of prime factorisation is considered. 

In §39, we discuss a generalisation of the rings Z, defined 
in §29. If m is a prime number, then, as we have seen, Z,, is 
a field. We generalise this result and in doing so obtain a 
method for constructing a wide variety of fields. This in- 
cludes not only the fields Z, (where p is a prime number) 
but the field C of complex numbers (strictly speaking, we 
define a field isomorphic with C). Thus effectively we give 
a definition of the field of complex numbers which places it 
in a wider setting than the definition given in §28. 

In the construction described in §39, we start with a 
field F and obtain a new field F’ which is a factor ring formed 
from the ring P(F) of polynomials over F. The field F’ con- 
tains a subfield isomorphic with F, as is shown in §40. We 
call F’ a simple extension of F. In particular, C can be 
regarded as a simple extension of R. Intuitively, the pro- 
cedure here can be regarded as the reverse of that discussed 
in §28, where we noted that the complex field contains a 
subfield isomorphic with the real field. 


§34. Polynomials in a single indeterminate. In element- 
ary algebra, an expression of the form 
P(X) = PotPiXt+p2x* +... +P,X", (34.1) 


where Po; P1,P2.--+»Pn are certain numbers, is called a 
polynomial in x; if p, # 0, then this polynomial is said to 
have degree n. Given two polynomials p(x) defined by 
(34.1) and q(x) defined by 


G(X) = Jot Gs X+Gax + «6 +4 nx"; 
we say that their sum is the polynomial 


(Pot+4o)+(Pitgi)xt+(P2t92)x7°+..-, (34.2) 
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the last term being 


pox” ai att = i, 
Qax” Al ite 7, 
(P.+g,)x” if m=n. 


We say that their product is the polynomial 


Podo + (Pod1 +P190)X + (Pod2 +P191 +P290)% + ++ 
tee peg; (34.3) 


the coefficient of x” being expressible in the form 


Podrt+P19r-1 t+ «++ +PrIo 


if we agree to define g, to be zero if s>m and p, to be zero 
if s>n. The expressions (34.2) and (34.3) are obtained 
simply by ordinary addition and multiplication, assuming 
that the commutative, associative and distributive laws hold. 

The above statements should not be regarded as giving 
precise definitions of polynomials and addition and multi- 
plication of polynomials, because the nature of x is ignored. 
If Po, Pis-++>P, and x are all real numbers, then the ex- 
pression on the right in (34.1) is also a real number. This 
leads to the idea of defining a polynomial as a mapping 
which associates with the number x the number p(x) given 
by (34.1). However, as already indicated in §33, this point 
of view is inappropriate in abstract algebra. We therefore 
use a different approach, which in some ways is similar to 
that adopted in §28 in defining complex numbers. 

The main feature of the definition is that it extracts the 
numbers po, Py,-.-,P, aS being the essentials in a poly- 
nomial, just as the strict definition of a complex number 
extracts the “real” and “ imaginary’ parts as being the 
essentials. The forms taken by (34.2) and (34.3) are used as 
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guides in defining addition and multiplication of poly- 
nomials. 

We begin by defining polynomials with real coefficients. 
Het 


{Pr} = (os isos rs Pa) 


be an infinite sequence of real numbers of which all but a 
finite number are zero. (We could consider finite sequences 
instead, but this has certain technical disadvantages.) In the 
set P(R) of all such sequences we define two operations, 
addition and multiplication, as follows. If p = {p,} and 
q = {q,}, we define p+gq to be the sequence {s,}, where 
Sy = Dnt, Thus 


Pa} ade) = 1st ans 


where the symbol + on the left-hand side stands for the 
new operation of addition and the symbol + on the right- 
hand side stands for addition of real numbers. We define 
the product pq to be the sequence {¢,}, where 


n 
ty = ny PRI n-k 
k=0 


It is easily verified (the reader should check each step) 
that the set P(R) is a ring with respect to these operations. 
We call this ring the polynomial ring over the real field and 
we call its elements polynomials over the real field. We call 
the numbers p, the coefficients of the polynomial p = {p,}. 

The zero element of the polynomial ring P(R) is the 
sequence whose members are all zero; in keeping with the 
additive notation we shall denote this polynomial by 0. If 
p = {p,} is any non-zero polynomial, the largest integer n 
for which p, # 0 is called the degree of p. We do not define 


108 ELEMENTARY ABSTRACT ALGEBRA § 34 


the degree of the zero polynomial. If p is of degree n and 
P, = 1, we call p a monic polynomial. 

The polynomial ring P(R) is in fact an integral domain, 
the unity of which is the polynomial e = {e,} defined by 


e=1, e=0 Gap: 


However, a polynomial of degree n does not have an inverse 
with respect to multiplication in P(R) unless 1 = 0, for the 
degree of the product of two non-zero polynomials is the 
sum of their degrees, so that the product cannot be e (which 
has degree 0) if the degree of one of the polynomials is not 
zero. Thus P(R) is not a field. 

In the above definition of polynomial, the idea of an 
indeterminate x was not used. However it is desirable to 
recover the familiar notation for a polynomial used in (34.1), 
but with the difference that x is defined. The reader should 
again compare this situation with that occurring in §28 
when we attached a meaning to the complex number i. 

Given any polynomial p = {p,} and any real number a, 
we shall write ap for the polynomial {«p,}. Thus we define 
a type of ‘* multiplication ” in which the elements involved 
are in different sets; « is a real number and p is a poly- 
nomial. Similar operations are encountered in more general 
circumstances in Chapter V, where we consider vector 
spaces. It is easily shown that if p,q are polynomials and 
a, B are real numbers, then 


a(p+q) = ap+aq 
(«+ f)p = ap+ fp 
a(Bp) = (a)p aa 
a(pq) = (ap)q = p(aq). 
We now consider the polynomial x = {x,} defined by 


X =0, x,=1, x,=0 (n22). 
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Then x? = {u,} where 
uta=1, u,=0 (#2) 
and in general x’ = {v,}, where 
p=) 9, =0 50 + r). 
If p = {p,} is any polynomial of degree k, we can write 


P = (Po: P1; Pr, +++) 
— (Po: 0, 0, oe .)+(0, p;; 0, at )+(0, 0, Po, ee + eae 
= ,(1, 0,0; ...)+7,(, 1,0, ...)449,(0, 0, 1,...) +... 


= PoOtPiXtPox* +... tp. (34.5) 
Thus, if we agree to write py in place of poe, we have 
P = PotDiXt+p2x? + ..- Dx’, (34.6) 


so recovering the more familiar notation for polynomials. 
The symbol + used in (34.6) indicates addition of poly- 
nomials (not addition of real numbers). The symbol x has 
a precise meaning because it represents a certain poly- 
nomial. Thus we not only recover the familiar notation, but 
we account for the symbols used. 

Although it is convenient to replace poe by po as in 
(34.6), this procedure can lead to confusion because one 
symbol is being used to represent two different things. The 
number p, and the polynomial pp = poe are different enti- 
ties. However, for many purposes it does no harm to 
identify a polynomial of the form «ae, where « is a real 
number, with the number « itself. For if Po(R) is the subset 
of all polynomials of the form ae, then, as is easily shown, 
P,(R) is a field and the mapping f: R > P,(R) defined by 


f(a) = ae 


is an isomorphism. The situation is similar to that occurring 
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in the case of complex numbers, where it is customary to 
identify a complex number of the form a+ i0 with the corres- 
ponding real number a (see §28). 


§35. Polynomial rings. We now generalise the definition 
of polynomial given in §34 to cases in which the coefficients 
are not necessarily real numbers. Let R be a ring and let 
P(R) be the set of all infinite sequences 


{pn} = (Pos P1> P2» eee DPasss JF 
where p, € R and all but a finite number of terms in the 
sequence are zero. Let addition and multiplication in P(R) 
be defined as follows: if p,q are the sequences {p,}, {q,} 
respectively, then p+q is the sequence {p,+q,} and pq is the 
sequence {t,}, where 


t= aaa 
r=0 


Then it is easily verified that P(R) is a ring with respect to 
these operations (again the reader is urged to check each 
step). This ring is called the polynomial ring over R and its 
elements are called polynomials over R. Given such a poly- 
nomial p = {p,}, the elements po, p;, p2,... are called the 
coefficients of p. The zero element of P(R) is the poly- 
nomial whose coefficients are all zero. The degree of a non- 
zero polynomial p is the greatest integer n for which the 
coefficient p, is non-zero. 

We now examine some of the connections between R 
and P(R). The first result enables us to regard R as being a 
subring of P(R). 


THEOREM 35.1. The polynomial ring P(R) over a ring R 
contains a subring isomorphic to R. 


Proof. Consider the subset Po(R) consisting of those 
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sequences of the form p = (po, 0,0,...,0,...) which have 
all their members zero except possibly the first. It is easily 
shown that P,(R) is a subring of P(R). If we define a map- 
ping f: R > Po(R) by 


f(Po) = P 


then we have 


S(Po+o) = f(Po) +40): 
I (P90) = f(Po) fo): 


Also fis a one-one mapping of R onto P)(R). Hence fis an 
isomorphism. 


The argument used in the proof of Theorem 35.1 is 
essentially the same as that used in §34 to establish the 
existence of a subset in the ring P(R) isomorphic to the 
field of real numbers. However, in the former case we made 
use of the identity element with respect to multiplication; 
in Theorem 35.1 we are not assuming the existence of such 
an element and hence have expressed the proof in a different 
form. We have shown in fact that the existence of an iden- 
tity is not essential for our argument. 

The next three results give examples of properties which 
** carry over’ from R to P(R). 


THEOREM 35.2. The ring P(R) is commutative if and only 
if R is commutative. 


Proof. Suppose that R is commutative. If p = {p,} and 
q = {q,} are two polynomials in R, then 


pq = {ty} 


where 


, = L Pent = Pon t+P19n-1 + «++ +PnQo 
r 
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and 
qp = {u,}; 


where 
u, = LdPa-r = GoPn*+91Pn-1 + ea! +4nPo- 


Since R is commutative, we have ¢, = u,. Consequently 
Pq = qp and hence P(R) is commutative. 

Conversely, suppose that P(R) is commutative. Then, 
in particular, P)(R) is a commutative subring and so R, 
being isomorphic to Po(R), is also commutative. © 


THEOREM 35.3. The ring P(R) has a unity if and only if 
R has a unity. 


Proof. Suppose that R has a unity 1. If 
C0). 0, . FOR: .); 


then every polynomial p over R satisfies pe = p = ep and 
hence P(R) has a unity e. 

Conversely, suppose that P(R) has a unity e. Suppose 
that 


@ = (25,6), €3; - + > 
Since pe = p = ep for any polynomial p, we have 
Pofo = Po = &oPo 


for any element py € R. Hence R has a unity eg. & 


THEOREM 35.4. The ring P(R) is an integral domain if and 
only if R is an integral domain. 
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Proof. We recall that an integral domain is a non-zero 
commutative ring D with a unity such that if x and y are 
any two non-zero elements of D, then xy # 0. 

Suppose first that R is an integral domain. By Theorem 
35.2, P(R) is commutative and by Theorem 35.3, P(R) has 
aunity. Let p, g be two non-zero elements of P(R). Suppose 
that p has degree n and that q has degree m. Then p, # 0 
and g,, # 0. The polynomial pg has the element p,q, aS one 
of its coefficients. Since R is an integral domain, p,g,, ¥ 0. 
Hence pq # 0. Therefore P(R) is an integral domain. 

Conversely, suppose that P(R) is an integral domain. 
Then, by Theorems 35.2 and 35.3, R is a commutative ring 
with a unity. Let po, qo be two non-zero elements of R. By 
hypothesis, the product pq of the polynomials (po, 0, 0, . . .) 
and (gp, 0,0,...) is non-zero. Since pq is the polynomial 
(Pogo, 9, 0, . ..), it follows that pogo is non-zero. Hence R 
is an integral domain. © 


Let « be any element of R and let p = {p,} be a poly- 
nomial over R. Then we write ap for the polynomial {ap,}. 
This is similar to the definition given in §34 for the real case 
and we can easily verify that 


es = ap+ Bp, 

a(p+q) = ap+aq, 

a(Bp) = (aB)p, es 
a(pq) = (ap)q. 


However, it is not true, in general, that 


a(pq) = p(q). 
This does hold if R is commutative, but it is not necessarily 
true otherwise. Moreover if we write pa for the polynomial 
{p,o} then, in general, px # ap. This is also due to the fact 
that R is not necessarily commutative. 
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If R has a unity 1, then we can carry out the procedure 
described in §34 to recover the familiar notation 


PotPiX+PoX" + «+. +P, X" (35.2) 


for a polynomial of degree n. If R does not have a unity 
then the construction does not work and so we have no 
justification for the use of this notation. In most of the 
remainder of this chapter, we shall be considering cases in 
which R does have a unity and shall use the notation in (35.2). 


Exercise 35.1. If S is an ideal of a ring R, prove that the 
ring P(S) of polynomials over S is an ideal of the ring P(R) 
of polynomials over R. 


Exercise 35.2. Let r be any fixed positive integer. Let 
S, be the subset of all polynomials p = {p,} over R such 
that p, = 0 for k<r—1. Prove that S, is an ideal of the 
ring P(R). 


Exercise 35.3. Let S, (r>0) be defined as in Exercise 
35.2 and suppose that R is the field of real numbers. Let 
S3 be the subset of polynomials p = {p,} in S, such that 
P2290. Prove that S3 is closed with respect to addition and 
is such that for all pe S,, se S, we have pse S; and 
sp € S}. Prove also that S} is not an ideal in S, and hence 
that the result proved in Exercise 32.2 does not necessarily 
hold when R does not contain a unity. 


§36. Power series. Various generalisations of the defin- 
ition of polynomial given in §§34 and 35 are possible. For 
example, polynomials in several indeterminates can be de- 
fined by a similar procedure. In this book, however, we 
shall not consider polynomials in more than one indeter- 
minate. 
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In the definition given in §35 of the polynomial ring 
P(R) over R, we stipulated that the elements of P(R) consist 
of sequences with only a finite number of non-zero terms. 
The reason for this is that the formal expressions of the 
form (34.1), on which the ideas were based, consist of finite 
sums. It can easily be seen, however, that it is still possible 
by our method to define addition and multiplication without 
this restriction. Let S(R) be the set of all infinite sequences 
toe) = (Sens{, So, - + oe S,,---) where sje R. In sS(R), we 
can define addition and multiplication as in §35: that is, 


{Sa} + tt} = {Sut ta} 
{Sn} {tnt = {uns 


n 
where u, = > S,t,-,. With respect to these operations, 
r=0 
S(R) is a ring. We call this the ring of power series over R. 
If R has a unity 1, then S(R) has a unity 
€='(1,0;0,0)....): 
If we write 
ee (OOS Oe) 
and define a{s,} = {as,} as in §34, then we can recover the 
familiar formal expression 
Sp@tS{XHSQX7 + 06. FSX" + 00. 
for a power series, just as in §34 we recovered the familiar 
formal expression for a po ome We can also write the 


power series in the form om s,x" by defining x° =e. It 


should be understood, fromeree that this is just a formal 
notation since the sum of an infinite number of sequences is 
not defined. Thus questions concerning convergence do not 
arise. 
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§37. Euclidean rings. The process of dividing an integer 
a by another (non-zero) integer b and determining the quo- 
tient and remainder is familiar in elementary arithmetic. 
Likewise in elementary algebra we encounter the familiar 
process of dividing a polynomial (with real coefficients) by 
a second polynomial and determining the quotient and re- 
mainder. The similarity between the two processes suggests 
that both the ring Z of integers and the ring P(R) of poly- 
nomials with real coefficients are special cases of a general 
type of system in which the operations of determining 
quotients and remainders are possible. If systems of this 
type are suitably defined, we can generalise the concept of 
factorisation of integers and polynomials: in particular, we 
can generalise the ideas of prime factor and highest common 
factor. 

It turns out that the most suitable definition is as follows. 
Let D be an integral domain. Suppose that there exists a 
mapping g which associates with each non-zero element a 
of D a non-negative integer g(a) satisfying the following: 


(i) g(ab)= g(a) for all non-zero a, be D, 
(ii) corresponding to each pair of elements a, be D, 
where b ¥ 0, there exist elements q, r € D satisfying 


a= bq+r 


and such that either r = 0 or g(r)<g(b). 
Then D is called a Euclidean ring. 


Example 37.1. Z is a Euclidean ring with respect to the 
usual operations of addition and multiplication and the 
mapping g given by g(a) = | a|. To prove this, we first of 
all recall that Z is an integral domain (see §25). Condition 
(i) for a Euclidean ring holds because g(ab) = | ab| and 
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g(a) = | a, so that 
g(ab) = |a| |b] 2 [al 
since |b| = 1. To prove condition (ii), we can use the 
following argument. Suppose that a is any integer and that 
b is any non-zero integer. Consider the set of all integers 
of the form a—bc where c is an integer. Let g be the value 
of c for which | a—be | has as small a value as possible and 
write r = a—bg. Then we have a = bg+r; we must show 
that either r = 0 or |r| < ||. Suppose that this is not 
so; that is,|r| = |b]. Ifr and b have the same sign, then, 
since 
a = b(q+1)+(r—)) 
we have 
|a—b(g+1)| =|r—b| <|r| 


which contradicts the definitions of rand q. If r and b have 
opposite signs, then, since 


a = b(q—1)+(r+b) 
we have 
| a—b(q—1) | =| r+] <r] 


again contradicting the definitions of r and g. Thus either 
r=0Oor[r| <|d]. 


Example 37.2. If F is a field, then the polynomial ring 
P(F)is a Euclidean ring with respect to the usual operations 
of addition and multiplication and the mapping g which is 
defined for each non-zero polynomial a by 


g(a) = degree of a. 
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To prove this, we first verify that P(F) is an integral 
domain. This follows from Theorem 35.4 and the fact that 
a field is an integral domain. Condition (i) for a Euclidean 
ring holds because if a and } are non-zero polynomials, then 
the degree of the product ab is not less than the degree of a. 
To prove condition (ii), we use an argument similar to that 
used in Example 37.1 above. Suppose that a is any poly- 
nomial and that bis any non-zero polynomial. Consider the 
set of all polynomials of the form a—bc, where c is a poly- 
nomial. Let g be a polynomial c such that a—bq has the 
smallest possible degree for polynomials of the form a—bc. 
Thus 


degree of (a—bq) <= degree of (a—bc) 
for all polynomials c. Write r = a—bq. Then we have 
a = bq+r; we must show that either r = 0 or the degree of 
r is less than the degree of b. To prove this, we suppose 


that r # O and that the degree of r is not less than the degree 
of b. Let b, r be given by 


b => bo t+ byx+ eee +bax” 
r mlngaer goes tree 


where 5,,, ', # 0; by hypothesis n=m. (Here we are using 
the traditional notation for a polynomial. As explained in 
§35, this is justifiable.) Then, if 


q =q+rqb,* x" 
we have 


a—bq' = (rotryxt+ ... +7,x")—- 
= (bo peer .. bx r.b ee 


so that a—bq’ is a polynomial whose degree is less than that 
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of r. This contradicts the definition of r. Thus either r = 0 
or the degree of r is less than the degree of b. 

It should be noticed that the argument used above in- 
volves the inverse with respect to multiplication of the co- 
efficient b,, in the polynomial b. Thus it cannot be used if 
we replace the polynomial ring P(F) over the field F by a 
polynomial ring over an integral domain. 


Exercise 37.1. Let G be the set of all complex numbers 
whose real and imaginary parts are integers. Let g be the 
mapping which associates with each such number the square 
of its modulus. Prove that G is a Euclidean ring with respect 
to the usual operations of addition and multiplication (for 
complex numbers) and the mapping g. (The elements of G 
are called Gaussian integers.) 


We next consider the concept of highest common factor. 
Let ay, a, be two non-zero elements of any commutative 
ring R with a unity. Then, if be R is such that 


ay = be 


for some ce R, we call b a factor of ay. If b is a factor of 
both ap and a, then it is called a common factor of a) and 
a,. Suppose that / € R is such that 


(i) h is a common factor of dy and aj, 
(ii) every common factor of ay and a, is a factor of h. 


Then h is called a highest common factor (H.C.F.) of ay) and 
a,. The phrase “ greatest common divisor’ (G.C.D.) is 
sometimes used instead of highest common factor. 


Example 37.3. Any two elements of Z possess a highest 
common factor. For example, the highest common factors 
of 42 and 70 are 14 and —14. 
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Example 37.4. Every non-zero element of a field F is a 
highest common factor of every pair of elements of F. 


THEOREM 37.1. (Euclid’s Algorithm.) Any two non-zero 
elements ao, a, of a Euclidean ring E have a highest common 
factor h such that 


h = uagt+va, 
for some elements u,v € E. 


Proof. Suppose that g(a,)<g(a,). (This clearly does not 
involve any loss of generality.) Then there exist g,,a,¢€E 
such that 


Ao = 4,4, +a2, where a, = 0 or g(a,)<g(a,). 


This follows from the definition of a Euclidean ring. 
Suppose that a, # 0. Then 


a, = q2a,+4a3, where a, = 0 or g(a;)<g(a,), 


again from the definition of Euclidean ring. Effectively 
what we have done is to divide ay by ay, finding the quotient 
and remainder; then to divide a, by ap, finding the quotient 
and remainder. We repeat this process so long as we obtain 
a non-zero remainder at each stage. Thus we have 


Go = 414,442, g(a2)<g(a,) C7 
Q, = 4242+43, g(a3)<g(a2) GTZ) 
Q2z = 4343+44, 8(a4)<g(as) (37.3) 


But the numbers g(a;), g(a), g(a3), ... are positive integers 
and since they form a strictly decreasing sequence, the pro- 
cess must stop after a finite number of stages. Thus there 
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exists a positive integer s such that 
as—-2 = s-1 a,-1+4,; as # 0, g(a,)<g(a,-1) (37.4) 
as~4 = As. (37.5) 


It may happen, of course, that a, = 0, in which case s = 1 
and the system of equations (37.1) to (37.5) reduces to the 
single equation ay = q1Q;. 

From (37.5) we see that a, is a factor of a,_,; then (37.4) 
shows that a, is also a factor of a,_,. Similarly a, is a factor 
of a,_, and continuing the process backwards through the 
set of equations we find that a, is a common factor of ap 
and a,. But (37.1) shows that any common factor of ay and 
a, is a factor of a,; (37.2) shows that any common factor of 
a, and a, is a factor of a3 so that any common factor of ay 
and a, is a factor of a3. Similarly any common factor of ag 
and a, is a factor of a, and continuing the process forwards 
through the set of equations we find that any common factor 
of ao and a, is a factor of a,. It follows that a, is a highest 
common factor of a) and a. 

From (37.1) we get 


G2 = Ggo~ M14 
and then, from (37.2), we get 
a3 = —42%9+(1+9192)41- 


Thus @,,a, are both expressible in the form uay+va,, 
where u,v€ E. By extending this argument we can show 
that each a; is expressible in this form; hence in particular, 
a, is expressible in the form uay+ vay. © 


It should be observed that we do not talk about the 
H.C.F. of ay and a,. In general, there is more than one 
possibility. In the next theorem we show how all the possi- 

EAA I 
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bilities can be determined. Before stating this theorem, it is 
convenient to make the following definition. Let R bea ring 
with a unity e. Then an element x € Ris said to be invertible 
if it has an inverse with respect to multiplication: that is, if 
there exists y such that xy = e = yx. 


THEOREM 37.2. Let a,b be two elements of an integral 
domain D and suppose that h is a highest common factor of a 
and b, Then if s is any invertible element of D, the element sh 
is also a highest common factor of a and b. Conversely, if 
h' is any highest common factor of a and b, then h' can be 
expressed in the form sh where s is an invertible element of D. 


Proof. Since h is a common factor of a and b, we have 
(= qh, b= qzh 


for some q;,q,¢ D. Hence, if s is an invertible element, 
with inverse s~', we can write 


a=4q,s ‘sh, b =q,s~'sh 


and so sh is a common factor of a and b. Since h is an 
H.C.F. of a and b, every common factor of a and b is a 
factor of h. Clearly every factor of fA is also a factor of sh, 
so it follows that sh is an H.C.F. of a and b. 

Now suppose that h’ is any H.C.F. of a and b. Since h 
is an H.C.F. of a and 5, h’ is a factor of h. Similarly h is a 
factor of h’. Therefore there exist elements g3, g, such that 


h=q,h', h' =qyh. 
Hence h’ = (94q3)h’, so that 
(€—qaq3)h’ = 0. 


Since D has no divisors of zero, it follows that g4q, = e and 
since multiplication in D is commutative, this is sufficient to 
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show that q, is invertible. Hence h’ is of the form sh, where 
§ is invertible. © 


In the Euclidean ring Z, there are just two invertible 
elements: the integers 1 and —1. Thus, by Theorems 37.1 
and 37.2, every pair of non-zero elements has precisely two 
highest common factors, which are equal in magnitude but 
opposite in sign. It is usual to call the positive highest 
common factor of a and b the highest common factor of a 
and b. Similarly in the polynomial ring P(F), where F is a 
field, the invertible elements consist of all polynomials of 
degree zero. Thus if two polynomials a, b have H.C.F. h, 
then any H.C.F. of a, b is expressible in the form «h, where 
aeF. Of the polynomials «h there is just one which is a 
monic polynomial: that is, the last non-zero coefficient is 
unity. It is usual to call this the highest common factor of 
the polynomials a and b. 

In order to determine highest common factors in the 
rings Z and P(F), we carry out the repeated division process 
used in the proof of Theorem 37.1. 


Example 37.5. To find the H.C.F. of 588 and 4004, we 
first divide 4004 by 588, obtaining a quotient 6 and re- 
mainder 476: 


4004 = 6 x588 +476 (37.6) 


Next we divide 588 by 476, obtaining a quotient 1 and 
remainder 112: 

588 = 1x476+112. Bia) 
Then we divide 476 by 112, obtaining a quotient 4 and 
remainder 28: 


476 = 4x112+28. (37.8) 
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Finally we have 
11233428; (37.9) 


and so the H.C.F. of 588 and 4004 is 28. To find integers u 
and v such that 


4004u+ 588v = 28, 
we first write (37.6) in the form 
476 = 4004-6 x 588. 
Then, from (37.7), we get 


112 = 588—1 x (4004—6 x 588) 
= 7x 588—1 x 4004 


and from (37.8) we get 


28 = 496—4x%112 
= (4004 —6 x 588)—28 x 588+4 x 4004 
= 5x 4004—34 x 588. 


Exercise 37.2. Find the H.C.F. of the integers 1638 and 
2574. 


Exercise 37.3. Determine polynomials u,, wu, such that 
Pity +P2t2 = @, 


where p, is the polynomial 1+x+x?+.x° and p, is the 
polynomial 2—.x? (with real coefficients in each case). 


We end this section with another result on invertible 
elements, which is required in the next section. 


THEOREM 37.3. Let E be a Euclidean ring and suppose 
that c, dé E are such that 


c= ud 
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for some ue E and 

g(c) = g(d). 
Then u is invertible. 


Proof. It follows from the definition of a Euclidean ring 
that there exist g, ré E such that 


d=qct+r 

where either g(r)<g(c) or r = 0. Since c = ud, we have 

d = qud+r. 
Suppose that r # 0; then g(r)<g(c) and 

g(r) = g(d(e—qu)) 28). 
But since g(d) = g(c), it follows that g(r)=>2(c), which gives 
a contradiction. Hence r = 0 and d = qc. Therefore 
Cgc 

so that ug = e and u is therefore invertible. © 


Exercise 37.4. Prove that if u is an invertible element of 
a Euclidean ring E, then g(u) = g(e), where e is the unity of 
£. Deduce that g(u)< g(x) for all x in E. 


§38. Prime factorisation in Euclidean rings, We recall 
that a positive integer p is said to be a prime number if the 
only positive integers which are factors of p are p and 1. 
Another way of expressing this is to say that an element 
ne Z is prime if the only factors of n are n, —n, 1, —1. 

A similar definition applies to polynomials. We say that 
a polynomial p over a field F is an irreducible polynomial 
over F if the only factors of p are the polynomials of the 
form ap, we where « (40) is an element of F and e is the 
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unity of P(F). A polynomial which is not irreducible is said 
to be reducible. The idea of irreducibility of a polynomial 
corresponds to that of primeness of an integer. 


Example 38.1. If F is the field of rational numbers, the 
polynomial 2—x? over F is irreducible. However, if F is 
the field of real numbers, then the polynomial 2—x? over 
F is reducible, since ,/2—x and ,/2+ x are factors. 


Example 38.2. If Fis the field of real numbers, the poly- 
nomial 1+.x? over Fis irreducible. However if F is the field 
of complex numbers, then the polynomial 1+.x? over F is 
reducible. 


The concepts of prime integer and irreducible poly- 
nomial are generalised to Euclidean rings as follows. Let 
E be a Euclidean ring. Then a non-zero element pe E is 
said to be prime if it is not invertible and cannot be expressed 
as the product of two factors a, b such that g(a)<g(p) and 
g(b)<g(p). 

An alternative way of expressing this definition is given 
by the result proved in the following theorem. 


THEOREM 38.1. A non-invertible element p (40) in a 
Euclidean ring E is prime if and only if the only factors of 
p are 


(i) the invertible elements of E, 
(ii) the elements of E of the form sp, where s is an invert- 
ible element of E. 


Proof. Suppose that p is a prime element of E and that 
uis a factor of p. Then 


p= uq 


for someg e€ E. Since pis a prime, we have either g(u) = g(p) 
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or g(q) = g(p). Hence, by Theorem 37.3, either u is inver- 
tible or g is invertible. If g is invertible, then we have 


u=q"'‘p, 


where q~' is the inverse of g. Since q~! is also invertible, 
it follows that u is of the form sp, where s is invertible. Thus 
either u is invertible, or is of the form sp, where s is invertible. 

Conversely, suppose that p is a non-invertible element of 
E whose only factors are the invertible elements of E and 
the elements of the form sp, where s is invertible. Let a, b 
be elements of E such that p = ab. By hypothesis, a, b are 
either invertible or of the form sp, where s is invertible. If 
a is of the form sp, then we have 


g(p) 2 g(a) 
since p = ab; also 
g(a) 2 g(p) 


since a = sp. Hence g(p) = g(a) in this case. If neither a 
nor b is of the form sp, then both are invertible and so 
a = pb~', which leads at once to a contradiction. © 


Because Theorem 38.1 gives a necessary and sufficient 
condition for p to be a prime element, it enables us to give 
the following alternative definition of primeness: an element 
p «€ Eis prime if it is not invertible and if its only factors are 
of the form s or sp where s is an invertible element of E. 


THEOREM 38.2. Let a,b be two non-zero elements of a 
Euclidean ring E and let p be a prime element of E. Then if p 
is a factor of ab, either p is a factor of a or p is a factor of b. 


Proof. Suppose that p is not a factor of b. Then every 
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highest common factor of p and b is an invertible element, 
for, by Theorem 38.1, the factors of p are the invertible ele- 
ments and the elements of the form sp, where s is invertible; 
but an element of the form sp is not a factor of b, because p 
is not a factor of b. Since every invertible element is a 
factor of e, it follows that e is a highest common factor of 
p and b. Hence, by Theorem 37.1 we have 


e = upt+ub (38.1) 


for some elements u, v € E. Multiplying both sides of (38.1) 
by a, we get 


a = aup+vab 
and since p is a factor of ab it follows that 
a = (au+ovw)p 
for some we E. Therefore p is a factor of a. © 


We now prove one of the central theorems on Euclidean 
rings. This states that every element of a Euclidean ring can 
be expressed in effectively only one way as a product of 
prime factors. This result is a generalisation of the theorem 
that every positive integer can be expressed in one and only 
one way as a product of prime numbers. For polynomials 
the corresponding result is that a monic polynomial can be 
expressed in one and only one way as a product of irre- 
ducible monic polynomials. 


THEOREM 38.3. (Unique factorisation.) Let a (#0) be any 
non-invertible element of a Euclidean ring E. Then we may 
write @ = P\P2.-. Pn» Where py, Pr,..- Py are prime ele- 
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ments. If also 


a = 4192 -++Qm 


where 41,92,-+++%m are prime elements, then m =n and 
there is a permutation 


such that 


91 = S1Piy I2 = S2Di»-++s In = SnPi, 
where 81, 82,...,8, are invertible elements of E. 


Proof. We first show that a can be expressed as the 
product of a finite number of prime elements. This is 
obvious if a is itself a prime. If a is not prime, then there 
exist a, a, such that 


a= a,Q2 


where g(a,)<g(a) and g(a,)<g(a). Neither a, nor a, is 
invertible; for example, if a, were invertible then a, = a; ‘a 
and so g(a,)=g(a), which gives a contradiction. If a,, a, 
are both primes, there is nothing further to prove. If ap, 
say, is not a prime, we can repeat the process expressing a 
in the form 


where a3, a, are non-invertible elements such that 
(43) <8 (az) <g(a) 
8 (44) <2(az)<g(a). 
A similar operation can be carried out if a, is not a prime. 


Thus a can be expressed as the product of three or four 
elements, amongst which there is at least one factor x 
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satisfying g(x)<g(a)—2. If not all the elements are primes, 
we can repeat the process, expressing a as a product of at 
most eight elements, amongst which there is at least one 
element x satisfying g(x) <g(a)—3. 

If we continue in this way, the process eventually stops, 
since each g(x) is a positive integer. Therefore at some stage 
we find that we have expressed a as a product of a finite 
number of prime elements. 

Thus the first part of Theorem 38.3 is proved. Suppose 
now that 


@ = PyP2-+ + Pn = 1192 +++ Im (38.2) 
WHETE'P,, Po, + + «+ Dav G1siQas > «= 9Qm ate primes. "by Geen 
q, is a factor of p,(p,...p,). Since g, is a prime, it follows 
from Theorem 38.2 that qg, is a factor either of p, or of 
P2...p,- Ifq, is a factor of p,, then, by Theorem 38.1,9, 
is of the form s,p, where s, is an invertible element, since 
q; cannot itself be an invertible element. If q, is not a factor 
of p,, then it is a factor of p, ...p,. Anargument similar to 
that just used shows that either g, = 5,p, for some inver- 
tible element s, or q, is a factor of p;...p,. Continuing 
in this way we see that, whatever happens, there is some 
integer i, such that 1<i, <n and 


91 = S;Pi,> 
where s,; is invertible. Then, by (38.2), 


P1P2 +++ Pn = S1Pi,d2 - «+m: 
Hence 


Py ere *Pi,-1Pi,+1 ++ °Pn = 5192 --+dm 
Repeating the above argument, we can show that 


G2 = S§2Pi.» 
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where s, is invertible and 1 <i, <n, i, 4 i,. Continuing the 
process, we can express the remaining q’s in a similar way. 
Clearly n=m, for otherwise we should ultimately obtain 


Ca TSS eels Sng tf) 9) Gs 


which would imply that ¢,41,..-;4m are invertible, con- 
trary to hypothesis. Again m=n, for otherwise we have 


Pmt+i «++ Dy = Sz +++ Sms 


which implies that p,,.,,...,p, are invertible. Therefore 
m=nandq,...,;4, are given by 


Q1 = §1Piys 72 = S2Dign s+ +9 In = SnDins 


Miele Syesa... ., 5, are nvertiole and. 7,,i,,..., 4, are the 
integers 1,2,..., in some order. © 


§39. Residue class rings. We recall that in §29 we dis- 
cussed the ring Z, of residue classes mod n, where n is a 
positive integer. By the result in Exercise 32.4, Z, is iso- 
morphic to a certain factor ring of Z. We shall generalise 
this idea. 

Let R be a commutative ring with a unity. Let a be any 
element of R and let K, be the set of all elements of R which 
have a as a factor. Then K, is an ideal in R. For if c, d are 
elements in K,, then 


c=q,a, d=qza 
for some elements g,, gq, and hence 
e—d = qya—qua = (4, —92)a. 


Thus if c,deK,, then c-de K,. Hence K, is a subgroup 
of R with respect to addition (see Theorem 16.1). Also, 
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given any element ue R, we have 


uc = u(q,a) = (uq,)a, 


so that if ce K, then uce K, for all ue R. Hence K, is an 
ideal in R. We denote the factor ring R/K, by R, and call 
R, the ring of residue classes mod K,. 

Our main theorem in the present section (Theorem 39.1) 
is concerned with the case in which R is a Euclidean ring. 
Before proving this theorem, however, it is worth while con- 
sidering a special case. 

Let P(R) be the polynomial ring over the real field R. 
Let c be the element 1+.x? of P(R) and let K, be the ideal 
consisting of all polynomials over R which have 1+ x? as a 
factor. The elements of the factor ring P.(R) (= P(R)/K,) 
are cosets of K, (see §32). If pe P(R) then we have 


p=qcetr 


where r is either zero or is a polynomial in x of degree 0 or 1. 
Since gc € K,, it follows that p and r belong to the same 
coset of K,. Hence every non-zero coset of K, contains a 
polynomial of degree at most 1. Moreover this polynomial 
is unique, since the difference of two polynomials of degree 1 
cannot have a factor 1 +x? unless the polynomials coincide. 
We write 


[a, b] 


for the coset containing the polynomial a+bx. Then addi- 
tion in P,(R) is given by 


[a, b]+[a’, b'| = [ata’, b+b'] (39.1) 
and multiplication is given by 


[a, b]+[a’, b’] = [aa’—bb', ab’+ba’] (39.2) 
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To prove (39.2), we first multiply a+bx by a'+b'x to get 
aa’ + (ab' + ba’')x+bb'x?. This polynomial can be expressed 
in the form 


aa’ +-(ab’ + ba’)x + bb'(x? +1)—bb’ 


and so is in the same coset as (aa’'— bb’) + (ab’ + ba’)x. 

It follows from (39.1) and (39.2) that the factor ring 
P(R) is isomorphic with the field of complex numbers. In- 
deed equations (39.1) and (39.2) are effectively the same as 
equations (28.1) and (28.2). Thus an isomorphism between 
PR) and C is easily established. (The reader is recom- 
mended to verify each step, even though the details are 
trivial. The correspondence [a, b] > (a, b) must be shown 
to be a one-one homomorphic mapping of P,(R) onto C.) 

In this example, we have shown that the factor ring is a 
field. Another example of this kind was considered in §29, 
where it was shown that the ring of residue classes Z, is a 
field whenever v is a prime number. The following theorem 
unifies and generalises these results. 


THEOREM 39.1. If E is a Euclidean ring and p is a prime 
element of E, then the factor ring E, (= E/K,) is a field. 


Proof. Since E is a commutative ring with a unity, then 
E, is also a commutative ring with a unity. For if [a] 
denotes the coset of K, containing a, then 


[a] [6] = [ad], 

[6] [2] = [ba] = [2] 
and if e is the unity of E, then 

[2] [e] = [ae] = [e]. 


To prove that E, is a field, it is therefore sufficient to show 
that every non-zero element of E, is invertible. 
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Suppose that [a] is a non-zero element of E,. Then a 
does not have p as a factor. Hence, using the same argument 
as at the beginning of the proof of Theorem 38.2, the element 
e is a highest common factor of a and p, so that 
e = up+va 
for some u, ve E. Therefore 
[va] = [e—up], 
so that 
[>] [a] = [e] 
since up € K,. Hence [a] is invertible. © 
Exercise 39.1. Prove that there are just two distinct 
irreducible polynomials of degree 3 over the field Z,. De- 


noting these by p and gq, prove that the fields P(Z,)/K, and 
P(Z,)/K, each have eight elements and are isomorphic. 


§40. Algebraic extensions of a field. We now consider 
the particular case of Theorem 39.1 when E is the poly- 
nomial ring P(F), where F is a field. We first prove the 
following result concerning P,(F) (= P(F)/ K,). 


THEOREM 40.1. Let p € P(F) be an irreducible polynomial. 
Then the field P,(F) contains a sub-field isomorphic with F. 


Proof. Let a be any element of P(F). Since 
a = pqtr, 


where either r = 0 or r has lower degree than p, every coset 
of K, in P(F) can be expressed in the form 


[r] = [rotryx+ cee Lp ee taal: 


where k is the degree of p. Two such cosets [r] and 
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[r’] = DProtrixt ... +7r,_,2*71] are the same if and only 
if r; = r; for all i= 0,1,...,k—1, since the difference of 
two polynomials of degree less than kK—1 cannot have a 
factor of degree k unless the polynomials are equal. 

Let Fo be the set of all elements of P,(F) of the form 
[r], where r is either zero or has degree 0. Then [r]=[Fro]. 
Let f: Fo > F be the mapping defined by 


flr] = ro. 
Then it is easily shown that fis an isomorphism and so Fo 
is a subfield of P,(F) isomorphic to F. © 


Because of the result proved in Theorem 40.1, we call 
the field P,(F) a simple algebraic extension of the field F. 
If p has degree 1, then the argument used in the proof of 
Theorem 40.1 shows that P,(F) is isomorphic with F, 
because in this case Fy = P,(F). Thus the extension is 
trivial because the new field is essentially the same as F. If 
p has degree 2 or more, then P,(F) has a proper subfield 
essentially the same as F. 


Example 40.1. Let P(R) be the polynomial ring over the 
real field and let c be the element 1+.x? of P(R). Then c is 
irreducible and so P.(R) (= P(R)/K,) is a field. This field, 
which is isomorphic with the field C of complex numbers 
defined earlier, is a (non-trivial) simple algebraic extension 
of R. 


Example 40.2. Let P(Q) be the polynomial ring over the 
rational field Q and let d be the element 2—x? of P(Q). 
Then d is irreducible in Q and so P,;(Q) (= P(Q)/K,) is a 
field. The elements of this field are the cosets of Ky in 
P(Q); since d has degree 2, they can all be expressed in 
the form 

[a,b], 
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where [a, b] denotes the coset containing the polynomial 
a+bx. Two such elements [a, 6] and [a’, b’] are equal if and 
only if a = a’ and b = b’. Addition and multiplication in 
P,(Q) are given by 


[a1, b:]+[a2, b2] = [a, +@2, b, +52], (40.1) 
Lay, bi ][a2, 62] = (aya, +2b,b2, ayb, +544]. (40.2) 
The second of these equations holds because 


(a, +,x) (a, +b2x) = aya, + (a,b, +b,a2)x + by byx? 
=> (a,a, +2b,b.)+(a,b, +b,a,)x —b,b,(2—x?). 


From (40.2) it follows that 
(Opt) =n 0t-—ace (40.3) 


where e is the unity of P,(Q). Under the isomorphism 
J: Fo — F defined in Theorem 40.1, the element e of P,(F) 
corresponds to the unity of F. Thus in the present case it 
is natural to identify e with the rational number 1; when we 
do this we identify 2e with 2. Then (40.3) can be written 


(iGei]2 92 


so that it is natural to write v2 for [0, 1]. When we do 
this, we get 


i] 


[a, b] = [a, 0]+[0, 5] 
ae+b[0, 1] 


= at+b,/2. 


In other words, we can regard the elements of P,(Q) as 
being real numbers of the form a+5,/2, where a, b are 
rational. Another way of expressing this is to say that 
P,(Q) is isomorphic with the subfield of the real field 
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consisting of the numbers a+b,/2, where a and 5b are 
rational. 

If this case is compared with that of Example 40.1, it 
will be seen that they have a great deal in common. 


Let F, be any field and let F, be a simple algebraic 
extension of F,, with respect to some irreducible poly- 
nomial p, over F,. Then, by considering some irreducible 
polynomial p, over F,, we can construct a simple algebraic 
extension F of F,. Clearly it may be possible to continue 
this process indefinitely to obtain a sequence of fields 

1 ee ee gl 


meee 


each of which is a simple algebraic extension of its immediate 
predecessor. Any field which can be obtained from a given 
field F by means of a finite number of simple algebraic 
extensions is called an algebraic extension of F. Thus in the 
sequence F,, F,,...,F,,... defined above, each field F, 
is an algebraic extension of each of its predecessors. For 
convenience we also regard a field as an algebraic extension 
of F if it is isomorphic to an algebraic extension of F or 
if it is an algebraic extension of a field isomorphic to F. 


Example 40.3. Let F, be the field of rational numbers. 
The polynomial 2—x? in P(F,) is irreducible over F, and 
so we can construct the appropriate simple algebraic exten- 
sion F, of F;. It can be shown that the polynomial 3—.x? 
in P(F,) is irreducible over F, and so we can construct the 
appropriate simple algebraic extension F; of F,. Similarly 
it can be shown that the polynomial 5—x? in P(F3) is irre- 
ducible over F, and so we can construct the appropriate 
simple algebraic extension F, of F;. We can continue this 
process indefinitely. In general, it can be shown that the 

EAA K 
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polynomial p,—x? in P(F,), where p, is the rth prime, is 
irreducible over F, and so we can construct an algebraic 
extension F.,, of F,. Since the number of primes is infinite, 
the sequence F,, F,, F;,...,F,,... does not terminate. 
Each member is an algebraic extension of its predecessors 
and clearly F, is properly contained in F,,, since the irre- 
ducible polynomial used at each stage has degree greater 
than 1. 


§41. Algebraically closed fields. It has already been 
pointed out that if F’ is a simple algebraic extension of a 
field F by means of a polynomial of degree 1, then F’ is 
isomorphic to F. Therefore, if F is such that there are no 
irreducible polynomials in P(F) having degree greater than 
1, it follows that there are no simple algebraic extensions 
of F other than fields isomorphic to F. A field F such that 
there are no irreducible polynomials in P(F) having degree 
greater than 1 is said to be algebraically closed. 

If F is algebraically closed, then every polynomial over 
F of degree n can be expressed as the product of n poly- 
nomials of degree 1. To prove this it is only necessary to 
observe that any reducible polynomial can be expressed as 
the product of two polynomials each of lower degree, 
according to the definition of reducibility. 


Example 41.1. The real field R is not algebraically closed, 
because the polynomial 1 +x? over R is irreducible. 


Example 41.2. The complex field C is algebraically 
closed. This follows from what is called the ‘* fundamental 
theorem of algebra”’, which states that every polynomial 
over C has a factor of degree one. Hence the only prime 
elements of P(C) are the polynomials of degree one. For 
proofs of the fundamental theorem, the reader should con- 
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sult Theory of Equations by H. W. Turnbull, p. 56, also 
Functions of a Complex Variable by E. G. Phillips, p. 113. 


Exercise 41.1. Determine all the reducible monic poly- 
nomials of degree 2 over the field Z,. Prove that Z, is not 
algebraically closed. 


§42. Roots of polynomials. Let p be a polynomial over 
a field F. Then we can write 


P= DotDiXtPoxX7? + «2. DX", (42.1) 


Where Po, P1, P2,---+>P, are the coefficients of p. If « is an 
element of F such that 


P(®) = Potpyetprw7 +... +p," =0, (42.2) 


then we call « a root of the polynomial p. It is important to 
notice that the expressions potp,;x+... +p,x" in (42.1) 
and po+p,0+p,07+ ... +p,” in (42.2) mean quite differ- 
ent things. The first is a polynomial and the second is an 
element of F. In (42.1) the symbol + stands for addition 
in the polynomial ring P(F); in (42.2) it stands for addition 
in F. Furthermore pg in (42.1) really means poe, where e is 
the identity of P(F), but in (42.2) po stands for an element 
of F, 

A given polynomial p need not havea root. For example, 
if F is the real field, then 1+ x? has no roots, because there 
is no real number «@ such that 1+a? = 0. 

Suppose that a, b are two polynomials over F and that 
a is an element of F. From the definition of addition and 
multiplication of polynomials, it is easily verified that 


(a+b) (a) = a(a)+b(a), (42.3) 
(ab) (a) = a(a) b(«). (42.4) 
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It should be observed that the commutativity of multipli- 
cation in F is used in (42.4). We use these formulae in 
proving the next theorem, which is sometimes known as 
the Remainder Theorem. 


THEOREM 42.1. Let p be a polynomial over a field F and 
let «—x be a polynomial of degree 1 over F. Then there 
exists a polynomial q such that 


p = («—x)q+p(a)e. 
(In less strict terms, the remainder on division of p by 
a—x is p(a).) 
Proof. Since the polynomial ring over F is a Euclidean 
ring, there exist polynomials g, r such that 
p = (a—x)qtr. (42.5) 


Since «—x has degree 1, the polynomial r is either zero or 
has degree 0. Hence we can write r = roe, where ro € F. 
Using the properties expressed in (42.3) and (42.4), we see 
that 


P(e) = (4—a)q(a) +o 
and hence rpg = p(«). Thus 


p = (a—x)q+p(ae. © 


It should be noticed that, in this proof, we have not said 
“put x = a in (42.5). According to our strict definition 
of polynomial, this statement is meaningless. 


THEOREM 42.2. An element a € F is a root of a polynomial 
p over F if and only if «—x is a factor of p. 


Proof. Suppose that a is a root of p. Then p(w) = 0 
and so, by Theorem 42.1, p = (a—x)q; hence p has a factor 
a—x. 
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Suppose conversely that p has a factor «—x. Then, by 
Theorem 42.1, p(a)e can be expressed in the form 


pae = (a—x)q’ 


for some polynomial q’. If g’ were not zero we would have 
the impossible situation of a polynomial of degree zero, 
namely p(aje, being equal to the product of two non-zero 
polynomials one of which is of degree one. We conclude 
that q’ = 0 and consequently that p(«) = 0. © 


THEOREM 42.3. Let p € P(F) be a polynomial of degree n. 
Then p has at most n distinct roots. 


Proof. Suppose that p has n distinct roots a1, @2,..., Oq- 
By Theorem 42.2, «,— x is a factor of p. Hence p can be 
expressed in the form («,—x)q. Since p has degree n, g has 
degree n—1. By Theorem 42.2, «,— x is a factor of p. Since 
a,—x is irreducible, it follows from Theorem 38.2 that 
a,—x is a factor of «,—x or of g. But «,—a, # 0 and so, 
by Theorem 42.2, «,— x is not a factor of «,—x. Hence 
%,—x is a factor of g. Therefore p = (a, — x) («,— x)q’ for 
some polynomial q’ of degreen—2. By Theorem 42.2, a,—x 
is a factor of p. Since «,— x is irreducible, it follows from 
Theorem 38.2 that a,;—x is a factor of (a, —x) (a,—x) or 
of g’. But (a,-—a3) (4,—a«3) # 0 and so, by Theorem 42.2, 
a,—x is not a factor of (#,— x) (#,—x). Hence «,—x is a 
factor of gq’. Continuing in this way, we see that 
p = (a,— x) (4,—x)...(&,—X)Go where gy has degree zero. 

Now suppose that p has another root « different from 
01, %>,...,4,. By the argument used above, we can show 
that «—x is a factor of gg. But go has degree zero, and so 
we have a contradiction. Hence p cannot have more than 
n distinct roots. © 
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By using the following theorem, we can sometimes test 
a polynomial of degree 2 or 3 for irreducibility. 


THEOREM 42.4, Let p be a polynomial over a field F and 
suppose that p has degree 2 or 3. Then p is irreducible if and 
only if it has no roots in F. 


Proof. If p is irreducible, then it does not have a factor 
of degree I and so, by Theorem 42.2, it has no root @ in F. 
Conversely, if p has no root « in F, then, by Theorem 
42.2, p has no factor of degree 1; hence, since the degree of 
p is 2 or 3, p is irreducible. © 


CHAPTER V 
VECTOR SPACES 


§43. Introduction. The idea of a vector space features 
prominently in many branches of modern mathematics. 
The word “vector”? has a wider meaning to the mathe- 
matician than to the physicist, but physical (or geometrical) 
vectors, provided that they are properly defined, provide an 
excellent illustration of the mathematical idea of a vector. 
We therefore begin this chapter with a résumé of the essen- 
tial definitions and properties of such vectors. There are 
many mathematical systems in which two operations are 
defined satisfying properties similar to those of addition and 
scalar multiplication of geometrical vectors (by scalar multi- 
plication we mean the operation of forming the product of 
a real number and a vector; this must not be confused with 
the operation of forming the scalar product of two vectors). 
In defining a vector space, we collect such systems under 
one heading by selecting for our basic assumptions those 
common properties which seem to be the most character- 
istic. This gives us another instance of unification and 
generalisation. As in other cases, the best method of carry- 
ing out the generalisation has been determined largely by 
experience. 

In defining a vector space, we concentrate on addition 
and scalar multiplication. In the general definition, the 

143 


144 ELEMENTARY ABSTRACT ALGEBRA § 44 


operations of forming scalar products and vector products 
of geometrical vectors are not generalised. However both 
types of operation are important in generalisations to vector 
spaces which have some additional structure. In the case 
of the scalar product, the generalisation leads to the idea of 
a vector space having an inner product and the more general 
normed vector spaces, which play an important part in 
modern mathematical analysis. Such vector spaces are dis- 
cussed in §49 and §50. In the case of the vector product, the 
generalisation leads to the idea of a linear algebra, which 
includes in particular the cases of a Lie algebra and an 
associative algebra. We give the relevant definitions in 
§§51 and 52. 


§44, Geometrical vectors. The three-dimensional vectors 
of physics and geometry are no doubt familiar to many 
readers. We shall call such vectors geometrical vectors to 
distinguish them from the generalisations which we shall 
presently describe. The brief description of geometrical 
vectors which follows in this section and in §45 is not in- 
tended in any way to be exhaustive in its treatment, but is 
included for the benefit of readers to whom this theory may 
be unfamiliar. Our object is to draw attention to those 
properties which we intend to generalise to more abstract 
algebraic structures. 

Many physical entities such as mass or temperature are 
conveniently characterised in terms of a real number scale. 
Thus, when we are told that the temperature is 16° C, then 
the temperature is characterised by the number 16. Other 
physical entities such as the velocity of a particle or the 
moment of a couple possess both a magnitude and a direc- 
tion and are not fully determined unless both magnitude and 
direction are specified. For instance, if only the magnitude 
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of the moment of a couple is known we are still in doubt 
about the direction in space in which the couple is acting. 
By a scalar we shall mean an entity determined by a single 
real number and by a vector we shall mean an entity deter- 
mined by both a positive real number, measuring magnitude, 
and a direction in space. 

Since the concept of direction is a geometrical one, it is 
natural to represent a vector geometrically. Thus a given 
vector may be represented by any line segment whose direc- 
tion is that of the given vector and whose length is the 
magnitude of the vector measured in some agreed scale. It 
must be clearly understood that we regard north and south 
as different directions. In view of this it is insufficient to say 


Fic. 1 


that a certain vector is represented by the line segment join- 
ing two points A and B; it is also necessary to indicate, 
usually by an arrow, whether the direction of the vector is 


from A to B or from B to A. In the former case we may 
— 
denote the representative of the vector by AB and in the 
—> 
latter case by BA. In representing a given vector diagram- 


matically any point in space may be chosen as its “ tail”. 
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Once this point has been fixed, the point representing its 
“head ’’ is determined uniquely. Thus, in the parallelogram 
+ se — 
of Fig. 1, AB and DC represent the same vector but AD 
—> 
and CB represent different vectors since they have different 
(in fact opposite) directions. 

We draw attention to the fact that we have not stated 
that entities such as force and angular velocity are vectors. 
These quantities are specified not only by a magnitude and 
direction but also by a line of action. Such entities are 
sometimes called localised vectors to distinguish them from 


the free vectors which we are now investigating. It is clear 
— 
that a force represented by AB in figure 1 would have a 


— 
different effect on a rigid body from one represented by DC. 


Likewise an angular velocity of ten revolutions per second 
—> 


about the axis AB would describe a different motion from 


an angular velocity of ten revolutions per second about the 
—. 


axis DC. A localised vector may be specified by two vectors, 
one of which determines its magnitude and direction while 
the other determines the position, relative to an agreed 
origin, of some point on the line of action. We shall not 
consider localised vectors further in this book. 


§45. Addition of geometrical vectors. In representing a 
vector geometrically we have denoted a representative by 
two letters, indicating the end points, and an arrow, indi- 
cating which is the tail and which is the head. We shall 
denote vectors themselves by single letters such as a. Most 
text books on geometrical vectors use bold type to indicate 
vectors and italic type to indicate scalars. With generalisa- 
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tions in view we prefer to use ordinary lower-case italic to 
indicate vectors and lower case Greek letters for scalars. 
Thus, we might write a to denote the vector represented by 


es 
the directed line segment PQ. It should be remembered, 
-> 
however, that PQ is not the same as the vector a, but merely 
represents a. 
We proceed now to define the addition of two vectors 


— —>- 
aand b. If ais represented by PQ and b represented by OR, 


— 
then a+b is defined to be the vector represented by PR. An 
alternative formulation of the law of addition is the follow- 


— — 
ing. If a is represented by PQ and 5 is represented by PS, 


NES 
then a+b is the vector represented by PR, where R is the 
point such that PORS is a parallelogram. The equivalence 
of these two definitions may be readily confirmed by the 
reader. He may also verify by elementary geometry that 
addition as defined above is both commutative and associa- 
tive. Indeed geometrical vectors form an abelian group with 
respect to addition; if o denotes the vector of zero magni- 
tude (this has indeterminate direction and is conventionally 


— 
represented by PP, where P is any point) then, for all a, 
o+a=a+o=4, 


which shows that o is an identity element for addition. 


ons 

Further, if a is represented by PQ, then a has an additive 
te 

inverse —a represented by QP, since a+(—a) is then repre- 
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sented by PP which represents the zero vector o. In other 

words, 


a+(—a) =o. 


The vector a+a is evidently a vector in the same direc- 
tion as a but of twice its magnitude. It is natural to denote 
this sum by 2a. This introduces a new concept—that of 
multiplication of a vector by a scalar, the resulting product 
being a vector. To be more precise we make the following 
definition. If A is a positive real number, then Ja is the 
vector in the same direction as a but of A times its magni- 
tude. If Ais a negative real number we define Aa = (—1)(—a). 
We also define 0a and Ao to be the zero vector 0. For instance 
(—2)a is a vector in the direction of —a and of twice its 
magnitude. This is, of course, the same as —(2a). Indeed, 
if A, 4p denote real scalars and a, b denote vectors, the follow- 
ing laws are easily verified. 


la = a, 
A(ua) = (Ap)a = pa), 
(A+pa = dat pa, 
A(a+b) = Aat+Ab. 


It should be noticed that multiplication by a scalar in 
the case of geometrical vectors has the same meaning as 
multiplication by a real number. The term “scalar ’”’ is 
used because of physical connections. 


(45.1) 


The following theorem is of fundamental importance. 


THEOREM 45.1. If a, b, c denote vectors whose directions 
are not coplanar and if v is any given vector, then unique 
scalars A, pt, v can be found such that 


v = da+pb+ve. 
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Proof. We consider first the case in which no three of 
—_— 


the vectors a, b, c, v have coplanar directions. Suppose PP’ 
represents the vector v. As shown in Fig. 2, we can con- 
struct a parallelepiped each of whose edges is parallel to 


z 


Fic. 2 


one of the given vectors a, b, c (for instance PA and A’P’ 


are parallel to a) and such that PP’ is a diagonal. Then v 
_> > —: 


is the sum of the vectors represented by PA, AC’ and C’P’, 
which is the same as the sum of the uaoees represented by 
=> OO > 


PA, PB, PC. Since PA is parallel to a, PA must represent 


—> — 
a vector of the form Ja and similarly PB and PC must repre- 
sent vectors of the forms pb and ve respectively. Thus, 


v = dat+pb+ve 
for suitable choices of A, pL, v 
Suppose that we have 
v = datpb+ve = Ayat+pyb+yWX0. 
Then 
(A—A,)a = (44 -Wb+(1—v)c. 
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Hy 7H vires 
E b 
(a4) +(25')« 


But the latter contradicts our assumption that the direction 
of a is not coplanar with the directions of b and c. Hence 
A = A, and similarly p = p, and v = v,. Thus for a given 
vector v the scalars A, zp and v are unique. 

Thus the theorem is proved in the case in which no three 
of a, b, c, v have coplanar directions. In the remaining case, 
we may, without loss of generality, suppose that the direc- 
tions of v, a, b are coplanar. The theorem then follows if 
we choose v = 0 and use the two-dimensional analogue of 
Theorem 45.1, namely the following result. © 


THEOREM 45.2. If a,b denote vectors whose directions 
are not parallel and v is any given vector whose direction is 
coplanar with those of a and b then unique scalars A, jt can 
be found such that v = Aa+ pb. 


The proof in this case 1s left as an exercise to the reader, 
who will naturally resort to a parallelogram in his construc- 
tion in place of the parallelepiped used above. It should be 
pointed out, however, that here too there is a special case 
to be considered, namely the one-dimensional case in which 
v is parallel to one of a, b and c. This case has, in fact, 
already been used in the proof of Theorem 45.1. 

A set of geometrical vectors a, b,...,e,... is said to 
be a linearly dependent set if scalars «, B,...,¢,... not all 
zero, can be found such thatt 


aat+fpb+...+8e+...=0. 
ay Jae WS Bal GD noon lone ao then we require all but a 
finite number of the scalars 3 Bixee . to be zero, in order that 


wa--Bb+ ... +&e-+ ... should be eidebiel 
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The vectors a,b,...,e,... are said to be independent if 
they do not form a dependent set. Trivially, the zero vector 
forms a dependent set, since Jo = o and for a similar reason 
any set containing the zero vector is a dependent set. On the 
other hand no single non-zero vector is a dependent set. If 
two non-zero vectors a, b form a dependent set, then real 
numbers «, 8, not both zero, exist such that 


aa+ pb = o. 


Since «, 8 are not both zero, it is clear that neither of them 
vanishes and we can write 


a = (—B/a)b, 


which shows that a and b are parallel. 
If three non-zero vectors a, b, c form a dependent set, 
then «, 8, y exist such that a, B, y are not all zero and 


aa+Bb+ yc = a. 
If « # 0, then 
a = (—f/a)b+(—}/a)c, 


showing that the directions of a, b, c are coplanar. If, how- 
ever, « = O, then, as has just been shown, b and c are paral- 
lel, so that once again a, b, c have coplanar directions. 

We next show that any four vectors a,b, c,d form a 
dependent set. That is to say, given any four vectors 
a, b, c, d we shall show that it is possible to find scalars 
a, B, y, 6, not all zero, such that 


aa+Bb+yc+dd = o. 


If 6, c, d are not coplanar, then by Theorem 45.1, scalars 
A, H, vy can be found such that 


a = Ab+ypc+vd. 
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Choosing « = 1, B = —A, y = —y, 6 = —v, we obtain 
the desired result. If b,c, d are coplanar but c, d are not 
parallel then, by Theorem 45.2, we have 


b=Actia 


for some A, 4. In this case, we choose a=0, 8 = 1, 
y = —Aand 6 = —y. If c,d are parallel but d ¥ 0, then 
c = Ad for some A. In this case we choose « = 0 = f, 
y= 1 and 6= —/. Finally if d= 0, then we choose 
g=0=—) = 4 and om, 

The relevance of linear dependence to the geometrical 
concept of dimension is exhibited by the following. Suppose 
that the origin is chosen as the tail of the representative of 
each vector under consideration. If one vector forms a de- 
pendent set, this is the zero vector and the representative lies 
in a space of zero dimension, namely a point. If two vectors 
form a dependent set, their representatives lie on a space of 
one dimension, namely a line. If three vectors form a 
dependent set, their representatives lie in a two-dimensional 
space, namely a plane. The representatives of any four 
vectors lie in a three-dimensional space. 

Suppose that e,, e2, e; denote three vectors whose direc- 
tions are non-coplanar; then according to Theorem 45.1, for 
any vector a we can find scalars «,, «2, #3; such that 


a= Hey + &2€2+03e3. 


The scalars «,, «), a; may be called the coordinates or com- 
ponents of a relative to the basis e,, e,, e; and it is convenient 
to arrange these coordinates in an ordered triplet [a,, «3, &3 ]. 
Provided that we know which basis is being employed, the 
triplet [a,, «,, «;] defines the vector a uniquely. If we sup- 
pose that a particular basis has been specified, then we may 
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a= [a,, 02,03], b = [B,, Bo, B3). 

Then 

at+b = [a, +B, a.+B2, ¢3+f3], (45.2) 
since 

(We, +02€2 +%3€3)+ (Be; + Bre. + B3e3) 

= (0, +81 )e1+(42+f2)e2 + (a3 + Bs)e3. 

Similarly we can show that 
da = [Aa,, Aa, Aa3] (45.3) 

and furthermore that, still relative to the basis e;, e2, é3, 


é&; = Be 0, 0], 
6, = (051, 0); 
ey = 110)0, 11. 


I 


It must be emphasised that had a different basis been 
chosen, then a given vector a would in general be repre- 
sented by a different triplet of coordinates. 

All the properties of addition of vectors and of multipli- 
cation of a vector by a scalar may be deduced from (45.2) 
and (45.3). If we were to begin afresh by defining a vector 
as an ordered triplet of scalars and defining addition and 
multiplication by a scalar by means of (45.2) and (45.3), we 
would recover many of the results such as Theorem 45.1 
(which would then be an exercise in the use of linear 
equations). However, the resulting theory would have 
something less than the theory of geometrical vectors unless 
concepts such as direction and magnitude could be incor- 
porated. We shall consider such questions in §49. 

EAA L 
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§46. Vector spaces. The geometrical vectors of the pre- 
ceding sections prepare us for the following definition. 


A vector space V over a field F is a set of elements called 
vectors in which the addition of two vectors and the multi- 
plication of a vector by an element of F are defined such that 


(i) the elements of V form an additive abelian group; 
(ii) the formulae (45.1) hold for all vectors a, b of V and 
for all elements A, p of F. 


Thus a vector space V over a field F is a set together 
with two operations, addition and scalar multiplication, 
such that 


(1) for all x, yin V, x+y is defined and is in V, 
(2) x+y = y+<x for all x, y in V, 

(3) (x+y)+z = x+(y+2z) for all x, y, z in V, 
(4) there is an element o € V such that 


x+o =x for allxeV, 


(5) given xe V, there exists an element —x such that 
x+(-—x) = 0, 
(6) for all x e V and ae F, ax is defined and is in V, 
(7) («+ f)x = ax+ Bx for all x e V and all a, Be F, 
(8) a(x+y) = ax+ay for all x, ye VandallaeF, 
(9) «(Bx) = (af)x for all x e V and all a, Be F, 
(10) 1x = x for all x € V, where 1 denotes the unity of F. 


We may also denote this vector space by V(F) if it is 
considered advantageous to exhibit the field F. Extending 
our previous convention, we employ Greek letters to denote 
elements of the field F and small italic letters to denote 


elements of V. As before, we use the term scalar for an 
element of F. 
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Clearly the set of all geometrical vectors is an example 
of a vector space, but some of the other systems which have 
already been studied provide further examples. 


Example 46.1. The polynomial ring P(F) is a vector 
space over the same field F in respect of the usual definitions 
of addition of polynomials and of multiplication of a poly- 
nomial by a scalar of F. The polynomials are in this case 
the vectors of this vector space. 


Example 46.2. The set of all 2 x 2 matrices with elements 
from a field F is a vector space with respect to addition 
defined as in §23 and scalar multiplication defined by 


A | eta 12 = Ady, Ady, 
21 %22 Aca, Adz? 
Observe in this case that if e,;, €,2, 21,22 denote the 
matrices 


foo} [ooh lio} lor] 


respectively, then any vector a of this space, that is any 2 x2 


matrix 
a a 
ee 11 %12 
Oa1 X22 


with elements from F, can be written in the form 
A = yyy HO y 2 yo +1024 22022. 


The matrices e131, €12, €21, €22 form a “basis” for this 
vector space, in the same sense as the three non-coplanar 
geometrical vectors e,, €,,e3 in §45 form a basis for the 
space of geometrical vectors. 


156 ELEMENTARY ABSTRACT ALGEBRA § 46 


Example 46.3. More generally, the set of all m xn mat- 
rices with elements from F (where m, n are fixed positive 
integers) is a vector space with respect to operations defined 
as in Example 46.2. 


Example 46.4. The set of all real functions of x which 
vanish at x = 0 and at x = / and which are continuous in 
the interval 0<x</ form a vector space over the real field 
R with respect to the usual definitions of addition and of 
multiplication by a real number. 


Example 46.5. An obvious generalisation of the ordered 
triplets derived from geometrical vectors is an important 
illustration of a vector space, namely the set, denoted by F", 
of all ordered n-tuples 


a= [;;.0 oll 


of elements from a field F. In this vector space we define 


a+b= [a,,...90,]°16,,...,8,] = [eve .., oe 
da = Fits, sins ©) Sule, «- cada. 


Example 46.6. The set C of complex numbers is a vector 
space over the real field R with respect to addition of com- 
plex numbers and multiplication of complex numbers by 
real numbers. 


Example 46.7. The set C of complex numbers is a vector 
space over the complex field C with respect to addition and 
multiplication of complex numbers. 


The following results are easily deduced from the defi- 
nition of a vector space. 
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THEOREM 46.1. In any vector space V(F), the following 
properties hold: 


(i) if x is any vector, then 0x = 0, 
(ii) if x is any vector, then (—1)x = —x, 
(ili) if ae F, x e V(F) and ax = 0, then either « = 0 or 
x =20; 


Proof. By (10) and (7), we have 
x+0x = Ie+0x = (140)x = lx = x. 
Also, by (7), 


Ox = (0+0)x 
= 0x+0x, 
so that by (4), 
Ox — 0. 


Similarly 
(—1)x+x = (—1)x4+1x = (-14]1)x = 0x = 0 
so that by (5), 
(-—1)x = —-x. 


Finally, to prove (iii), we suppose that « 4 0, but that 
ax =o. Then, by (9) and (10), we have 
ato = a 1(ax) = (a 10)x = Ix = x. 
But, given any scalar f and any vector y, 


By = B(y+o) = By+ Bo, 
from which it follows that Bo = o. In particular, «740 = 


and consequently x = o. © 


§47. Linear dependence and dimension. As before a set 
of vectors a, b,...,e,... of a vector space V(F) is said to 
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be a linearly dependent set if a set of scalars a, B,...,€,..- 
belonging to F, not all zero, can be found such that 


aat+Ppb+...+€e+...= 0. 


(In the case in which the set of vectors is infinite, we require 
all but a finite number of the scalars a, B,...,&,... to be 
zero.) If the set is not linearly dependent we say that 
a, b,...,e,... are independent vectors. 

In some vector spaces an infinite number of independent 
vectors can be found. For instance the infinite set of vectors 
1, x, x”, x°,... of P(F) is not a dependent set. In other 
vector spaces a positive integer m exists such that n indepen- 
dent vectors can be found although every set of n+ 1 vectors 
is a dependent set. In this case the vector space is said to be 
finite dimensional} and of dimension n. A vector space which 
consists of the zero vector alone is regarded as having 
dimension zero. According to this definition the space of all 
geometrical vectors is one of dimension 3. It is not difficult 
to show that the space of all 2 x 2 matrices is one of dimen- 
sion 4. 

An expression of the form «a+ fb+ ... +¢e is called 
a linear combination of the vectors a, b,...,e. Ifa, b,...,e 
are independent, then it follows that the only vanishing 
linear combination of them is the trivial one in which 
a=fP=...=¢e=0, since otherwise they would form a 
dependent set. 

Suppose that in a vector space of dimension n the vectors 
€1,...,e, are independent. Since any n+1 vectors form a 
dependent set, then for any non-zero vector a there must 
exist scalars a, B,,...,8,, not all zero, such that 


aat+B,e,+ eos + Be, = 0. 


+ A separate volume in this series Vector Spaces of Finite Dimension, 
by G. C. Shephard, will be published in 1966. 
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It is clear that « 4 0, for otherwise e,,..., e, would not be 
independent. It follows that any vector a can be written in 
the form 


a= Oye, + Joo Snore 


where a, = —f,/« if a#o and «,;=0 if a=o. Since 
€1,---,e, are independent, a,,..., a, are uniquely deter- 
mined by a. Thus every vector of V is expressible as a linear 
combination of the vectors e,,...,¢e,. Alternatively, we 
express this by saying that V is spanned by e;,..., @,. 
More generally a set of vectors spans a space V if every 
vector of V is a linear combination of the set and every such 
linear combination belongs to V. An independent set such 
aS €1,...,,, Which spans V is called a basis of V. 


THEOREM 47.1. Every basis of a vector space V of dimen- 
sion n contains n vectors. 


Proof. A basis cannot contain more than n vectors since 
any n+1 vectors form a dependent set. We have seen above 
that V has at least one basis of n vectors, namely the set 
C15 +++ 5 &n, SO if We assume that the vectors f,,...,/, form 
a basis of V, we know that s<n. Since every vector of V 
and, in particular, the vector e, is a linear combination of 


Tess Js We Have 
ep =Afit... tafe 


for certain scalars 4,,...,A, not all zero. Suppose that 

A, # 0. If this were not so we could renumber the /; so 

that this condition is satisfied. Then 
1 Az 

f= 3, 14, ee 4," 


showing that f, is a linear combination of e,,f,...,f,. 
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It follows that any linear combination of f;, fo,...,f,isa 
linear combination of e,,/,,...,/, and consequently that 
the space is spanned by e,,/>,...,f, Thus for certain 
scalars jl;,..., H,, not all zero, 


C2 = Mer thofat ... tush 


Now pp,..-,H, cannot all be zero since otherwise é,, e, 
would form a dependent set. We may therefore suppose, 
after renumbering if necessary, that yp, #4 0. Then 


u Is 
f= 7 et ey fa 


showing that f, is a linear combination of the set 
C1, €2,f3,--+»f, and that the space is spanned by this set, 
since we already know it is spanned by e,, fo, /3,...5f4- 
Applying a similar argument repeatedly, at each stage re- 
placing an f by an e, eventually we find that the space is 
spanned by e,,...,e,. We see now that s must be equal 
to n for if s were less than n then e, would be a linear com- 
bination of e;,...,e, and e;,..., e,, e, would be a depen- 
dent set, which is contrary to our assumption that e,,..., e, 
form a basis. 


The concept of isomorphism mentioned earlier (see §§ 19 
and 26) in connection with groups and rings can easily be 
adapted so that it applies to vector spaces. 

Let V;(F) and V, (F) be two vector spaces over the same 
field F. These two spaces are said to be isomorphic if a one- 
one correspondence 


g: V,(F) > V2(F) 


exists which maps V, onto V, and which is such that for 
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all vectors a, b of V, and all scalars A of F 


g(a+b) = g(a)+g(6), 
g(Ja) = Ag(a). 


Such a one-one correspondence is known as an isomorphism 
of V;,(F) onto V,(F). 


Example 47.1. Let G be the space of geometrical vec- 
tors and let R* be the vector space occurring in Example 
46.5 when F is the real field R and m = 3. Let a,b,c bea 
basis for G (see §45). We define a mapping f: G > R° 
as follows: if v = Ja+pb+ve, we write 


AC) = [A, H, ¥). 


Then it is easily verified that f is an isomorphism of G 
onto R°. 


By generalising the ideas contained in this example, we 
shall prove the following theorem. 


THEOREM 47.2. Every vector space V(F) of dimension n 
over a given field F is isomorphic with the vector space F" (as 
defined in Example 46.5). 


Proof. Let V(F) be a vector space of dimension n and 
lete,,..., e, bea particular basis. Since any vector a can be 
expressed uniquely in the form a = a,e,;+... +4,e, we 
may set up a one-one correspondence between the vectors a 
of V(F) and the ordered n-tuples [«,,..., ,] of the vector 
space F”. For a reason which will presently appear, we find 
it convenient to use two different notations to describe the 
n-tuple [a,,...,@,]. This n-tuple is a vector of F” and as 
such we denote it by the abbreviation v,. We shall also 
designate it by a, to indicate that it is the n-tuple associated 
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with the vector a of V(F) when e,,..., e, is chosen as the 
basis. The significance of the statement 


ae = vy 
is merely that the relation 
(3 Chia oo 6 Brehs. 


holds. The one-one correspondence which we mentioned 
between V(F) and F” is then of the form a « a, and the rela- 
tions a,+b, = (a+b),, A(a.) = (Aa),, which follow from 
the definitions in F” of addition and of multiplication by a 
scalar, show that this correspondence is an isomorphism. © 


We have as an immediate corollary the next result. 


THEOREM 47.3. Let V,(F) and V,(F) be two vector spaces 
over the same field F. If V,(F) and V,(F) have the same 
finite dimension n, then they are isomorphic. 


Proof. If they are of dimension n, both are isomorphic 
with F” and so are isomorphic with one another. © 


It must not be thought that there is a unique isomorph- 
ism between a given V(F) of dimension n and F". Each 
basis of V(F) will yield an isomorphism. With the twofold 
notation which we introduced we could denote one iso- 
morphism by 


aa, = 0, 
and another, using f,,...,f, aS a basis, by 
a a; = Uz. 


Just as different n-tuples v, and v, correspond to the same 
vector a under different isomorphisms, so does the same 
n-tuple v, correspond to different vectors a and 5 under 


§ 47 VECTOR SPACES 163 


different isomorphisms. Thus if a = a,e,;+... +a,e, and 
b=afit+...+4,f,, then v, =a, = b, and one iso- 
morphism would give 


a a, — 6., 
while the other would yield 
b SV b; = 0,. 


We proceed now to examine briefly the relationship 
between two isomorphisms 


a>a,=0, 
a> ay = Ug, 
relating a given V(F) with F”. 
Suppose then that e,,...,e, and fj,...,f, are two 
bases of V(F). Since every e; is a linear combination of the 
vectors f;,...5j/,, We may write 


If a is any vector of V(F), then, for some a,,...,«, and 
Bs, ++>5 Pa We have 


a= 2 a,c; = d B, f;- 
t 
Then substituting for each e;, we obtain 


NLU > 0 Aaj fis 


5, 
so that 


Y 6 - x a, Ais) fj = 9. 


j 
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Since f;,...,f, are independent each scalar coefficient 
in this last equation must vanish and so, for each j 


Pe Ds oy Ajj. 


These scalar equations may be combined into the single 
matrix equation 


Bio sss Bal [ts re on | Asi = satan 


In introducing the square bracket notation for an ordered 
n-tuple we anticipated the fact that an occasion would arise 
in which it would be convenient to regard the n-tuple as a 
row vector, that is to say, as a matrix with one row andn 
columns. Denoting the square matrix on the right by M, 
the above matrix equation may be succinctly written 

Vs = 0,M,j. 


Interchanging the réles of the two bases we can also 
obtain a corresponding matrix equation 


vz = 0,M,, 
from which it follows by substitution that 
v, = 1,M,M, 
and that 
v,(I—M,M,) = 0, 


where J is the nxn unit matrix. The vector v, however 
is at our choice since we would have obtained an equa- 
tion of this form from any vector a. Choosing a equal 
to e;,€2,...,¢@, in turn, that is to say v, equal to 
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eee Ol Odes Ol, -. «9 [0, 0,..., 1] in tum, we 
find that every element of the matrix J—M,M, is zero. In 
other words 


M,M, = I. 
and similarly, eliminating v, instead of vg, 
M,M, = I. 


Thus, M, and M, are reciprocal matrices and so each is 
invertible. Hence the row vector a, may be obtained by 
post-multiplying the row vector a, by an invertible matrix, 
the latter being the same for every vector a of F. 

Just as in the case of groups and rings, mappings of a 
vector space V,(F) into another vector space V,(F) over 
the same field which are not necessarily one-one mappings 
onto V,(F) but which satisfy the other requirements for an 
isomorphism, are known as homomorphisms. Thus we say 
that a mapping h: V,(F) > V,(F) is a homomorphism if 


h(a+b) = h(a)+h(b), (47.1) 
h(da) = Ah(a) (47.2) 


for all a,beV,(F) and all Ae F. A homomorphism of 
V,(F) into V,(F) is also known as a linear mapping from 
V,(F) to V2(F). 


Exercise 47.1. Show that h: V,(F) > V,(F) is a homo- 
morphism if and only if 


h(Aa+ pb) = Ah(a)+ phd), 
for all a, be V,(F) and all A, we F. 


Suppose now that V,(F) and V,(F) have finite dimen- 
sions n and m respectively. Let e,,...,¢, be a basis for 
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V,(F) and let f;,... fm be a basis for V,(F). Suppose that 
h: V,(F) > V,(F) is a homomorphism. Then h(e,) is a 
vector in V,(F) and so can be expressed in the form 


yy hinfp» 

p=1 
where h,,e F (i= 1,...,; p= 1,...,m). The hj, are 
the components of h(e;) with respect to f;,... 5S. and so 


are uniquely determined by the two bases and the mapping 
h. Since h satisfies (47.1) and (47.2), the vector h(x) corres- 
ponding to the vector x € V,(F) is 


pS, xi ty 
f= pail 


which shows that h is completely determined by the ele- 
ments h;, of F and the two bases. Thus A can be regarded 
as being completely determined by the nxm matrix H, 
whose (i, p)th element is ;,, and the two bases. 


§48. Subspaces and direct sums. A subset W of a vector 
space V is called a subspace of V if Wis itself a vector space 
with respect to the operations of addition and scalar multi- 
plication in V. To prove that a subset W is a subspace of 
V, it is sufficient to verify that x+y e W for all x, ye W 
and ax € W for alla e Fand allx e W. For then W contains 
the zero vector 0, since 0x = o by Theorem 46.1. Again 
by Theorem 46.1, W contains — x if it contains x. Thus W 
is a vector space, because conditions (2), (3), (7), (8), (9) 
and (10) for a vector space are satisfied in any subset of V. 


THEOREM 48.1. A subspace of a finite-dimensional vector 
space has finite dimension. 


Proof. Let W be a subspace of V. If W consists only of 
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the zero vector, then it has dimension 0. Otherwise W con- 
tains at least one non-zero vector x,. The subspace spanned 
by x, is contained in W. If it is the whole of W, then W has 
dimension 1. If not, W contains a vector x, such that x, 
and x, are linearly independent. The subspace spanned by 
x, and x, is contained in W. If it is the whole of W, then 
W has dimension 2. If not, W contains a vector x; such 
that x,, xX, Xx; are linearly independent. Continuing in this 
way, we find at each stage that either W is finite dimen- 
sional or that we can increase the number of linearly inde- 
pendent vectors in W by one and proceed to the next stage. 
But W is contained in V and V has at most n linearly inde- 
pendent vectors. Hence the process terminates after a finite 
number of stages and so W has finite dimension. © 


THEOREM 48.2. If V is an n-dimensional vector space and 
W is a subspace of V such that W # V, then the dimension 
of W is less than the dimension of V. 


Proof. By Theorem 48.1, W is finite-dimensional. Let 
W have dimension k. Then W contains k linearly indepen- 
dent vectors. Since W is contained in V and V has dimen- 
sion n, we have k<n. If k =n, then W is spanned by n 
linearly independent vectors. But any n linearly independent 
vectors in V form a basis for V. Hence if k =n then 
W= V. © 


Let S, and S, be two subsets of a vector space V. By 
the intersection S, OS, of S, and S, we mean the set of all 
vectors common to S, and S,. By the linear sum S, +S, of 
S, and S, we mean the set of all vectors of the form x, +2, 
where x, € S, and x, € S}. 


THEOREM 48.3. If S, and S, are subspaces of a vector 
space V, then S,S, and S,;+Sy, are also subspaces of V. 
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The subspace S, AS, is the largest subspace contained in both 
S, and S, in the sense that if W is a subspace contained in 
S, and S,, then W is contained in S; Sz. The subspace 
S, +S, is the smallest subspace containing both S, and S, in 
the sense that if W’ is a subspace containing S, and S», then 
W’ contains S,+S>. 


Proof. Suppose that x, ye S,; AS. Then x, ye S, and 
so x+yeES,, since S, is a subspace of V. Similarly 
x+yeS,; hencex+yeS,S,. Also axe S;S,. There- 
fore S,; MS, is a subspace of V. 

Suppose now that x, ye S,+S,. Then x can be ex- 
pressed in the form x,+x, and y can be expressed in the 
form y,+y2, where x,,y,¢S, and x,,y,¢S,. Hence 
x+y = (x, +¥,;)+(%.+y2), so that x+y is expressible in 
the form z,+z, where z,¢€S, and z,¢€S,. Also 
ax = a(x, +X2) = ax, +ax, and so axe S,+5S,. There- 
fore S,+S, is a subspace of V. 

Let W be any subspace of V contained in S, and S$). 
Then clearly W-S, OS,. On the other hand, if W’ contains 
S, and S,, then W’ contains every vector x, € S, and every 
vector x, € S, so that, since W’ is a subspace, W’ contains 
every vector of the form x, +x, where x, € S, and x, € S$). 
Hence W’ contains S, +S). © 


Example 48.1. Let a,b,c be any three vectors in G, 
the vector space of geometrical vectors. If $, is the sub- 
space spanned by a and 6 and S, is the subspace spanned 
by a and c, then S, OS, is the subspace spanned by a pro- 
vided that a, b,c are not coplanar; S,+S, is the subspace 
spanned by a, b,c. Thus if a, b, c are linearly independent, 
S,+S, = G. 


If S, and S, are two subspaces of V such that S$, AS, 
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consists of the zero vector and S,+S, = V, then S, is said 
to be complementary to S, and is called a complement of S,. 
Clearly if S, isa complement of S,, then S; is a complement 
of S,. In general, a given subspace S, has more than one 
complement. 


Example 48.2. Let e,,e,,e; be a basis for G. Let 
S',,S, bethesubspaces spanned bye, and bye, e; respectively. 
Then S, is a complement of S,. Let S, be the subspace 
spanned by e, +e, e,+e3. Then Sj is also a complement 
of S;. 


Exercise 48.1. If S, and S, are two subspaces of V such 
that S, 5S, consists of the zero vector, prove that any 
vector of S,+S, can be expressed in only one way in the 
form s, +52, where s, € S, and s, € S}. 


Let S and T be any two vector spaces over the same 
field F. Let U be the set of all ordered pairs (s, t), where 
se S and teT. We define addition and scalar multipli- 
cation in U by 


(51, t1) + (52, to) = (8, +582, ty +02) 
a(s, t) = (as, at), 


where $,, 52,5€S,t,,t,,teT andaeF. With respect to 
these operations, U is a vector space. The zero vector of U 
is the pair (0, 0); here we use the same symbol to denote 
two different vectors, for the first o denotes the zero vector 
of S and the second denotes the zero vector of T. 

The vector space U constructed in this way from S and 
T is called the direct sum of S and 7 and is denoted by 
S @ T. 

The subset of all vectors in U of the form (s, 0), where 
seéS, is a subspace S* of U. Clearly S* is isomorphic with 


EAA M 
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S under the “ natural’ isomorphism f: S* — S defined by 
f{(s, 0)} = s. Because of this natural isomorphism, we 
sometimes “ identify’ S* with S; in other words, we re- 
gard them as being the same vector space. Similarly, the 
subset of all vectors in U of the form (0, ft), where t é€ T, is 
a subspace 7* of U and T* is isomorphic with T. Given any 
vector (s, t) € U, we can write 


(s, t) = (s, 0) +(0, f) 


and thus any vector in U can be expressed in the form 
s*+1*, where s*e S* and t*e7*. Moreover, it is easily 
seen that S* A7T* consists only of the zero vector of U and 
so S* and T* are complementary subspaces of U. 

It is important not to confuse the sum $+T of two sub- 
spaces S, 7 of a vector space V with the direct sum S @ T. 
However, if S and T are subspaces of V such that SaT 
consists of the zero vector only, then there is a close con- 
nection between S+7Z and S @ 7, which enables us to 
“identify ’’ these two vector spaces. For, as we have 
already seen, S © T has two complementary subspaces S* 
and T* isomorphic in a natural way with S and T respec- 
tively, such that 


S@ T= S*+T*. 


Thus, in particular, if S and T are complementary subspaces 
of a vector space V, then V can effectively be regarded as the 
direct sum of S and T. We sometimes express this by saying 
that V is the internal direct sum of S and T and then refer 
to the direct sum defined above as the external direct sum. 


Exercise 48.2. Let S, T be vector spaces having dimen- 
sions m and n respectively. Let 5,,...,5,, be a basis for S 
and let t;,...,¢, be a basis for 7. Prove that S @ T is 
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spanned by the vectors (s,, 0), ..., (Sms 0); (0; t1),.++5 (0, t,). 
Deduce that S @ T has dimension m+n. 


The idea of the direct sum of two vector spaces can 
easily be generalised. Let S,, S,,..., 5S, be vector spaces 
over the same field F. Let U be the set of all ordered k-tuples 


WS 85s cece Se) 


where s;eS; (i = 1,2,...,k) and define addition and 
scalar multiplication in U by 


etSa5 << + s Sp) +(5y5S5,«--, 5p) = 
== (9; +54, S245), - «+55 +5,) 
lS sy. say Sp, as. ay ESE), 


Then U, together with these operations, is a vector space 
which we call the direct sum of S,, S,,..., S, and write 


U=S,08,0...08&. 


Exercise 48.3. Prove that if S,, S,, $3; are three vector 
spaces over the same field, then (S, ® S,)@ 583, 
S, ® (S, © S3) and S, ® S, ® S; are isomorphic. 


Exercise 48.4. Prove that if S,, S,,..., 5, are finite 
dimensional vector spaces having dimensions n,, 12,..., Mg 
respectively, then S,; ®S,@...@S, has dimension 
ny t+ng+ 508 +n, 

Exercise 48.5. Prove that every vector space of dimen- 
sion 7 is isomorphic with a direct sum of n one-dimensional 
vector spaces. 


§49. Unitary spaces. Geometrical vectors by their very 
definition are intimately associated with the concepts of 
magnitude or length and direction but we have not so far 
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introduced these ideas into more general vector spaces. We 
shall consider such problems in the present section. 

In the space of geometrical vectors we denote the mag- 
nitude or length of a vector a by || a ||. Thus || a |] is always 
a non-negative real number. A scalar product a-b is de- 
fined for any two vectors a, b by the relation 


a-b= [al] |b] cose, 


where 6 is the angle between the directions of a and b. This 
scalar product of two vectors is quite different of course 
from the product Ja of a scalar with a vector. Further the 
scalar product is not a vector but, as the name suggests, is 
ascalar. There is no question here of whether the associative 
law is satisfied or not, for an expression such as (a- b):c 
has no meaning. The scalar product of the scalar (a: b) 
with the vector c is not defined. It is not difficult to show 
that the following rules apply 


a-b=b-a, (49.1) 
a:(b+c) = (a: b)+(a:c), (49.2) 
(Aa): b = Aa: Bb). (49.3) 


The commutative law (49.1) follows from the definition and 
the distributive law (49.2) can be proved with the help of 
some elementary trigonometry. In (49.3) it is assumed that 
AeéR and the identity follows from the fact that if A is 
positive then || 4a || = 4 || a |]. 

We mention two immediate consequences of the defini- 
tion of a scalar product. First if a and b are perpendicular 
then a- b = 0. On the other hand if a: b = 0 then either 
the vectors are perpendicular or one of the vectors is the 
zero vector. Secondly a: a = || a ||?, which shows that the 
magnitude of a may be written +4/ (a+ a). We observe also 
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that the angle between a and 5 is given by 

a:b 
/(a: a) f(b bY 


from which we see that not only magnitude but also direc- 
tion can be specified by the use of scalar products. 


cos 9 = 


For the space of geometrical vectors it is frequently 
desirable to choose an orthonormal basis: that is, a basis of 
which each vector is of unit length or magnitude and such 
that any two basic vectors are orthogonal. Suppose that 
€1, 2, €3 is such an orthonormal basis. If we wish, they 
may be thought of as vectors of unit length lying along the 
axes of a rectangular coordinate system OX YZ. Since this 
basis is an orthonormal one we have immediately the 
following relations 


Ce ej = 6:3, 


in which 6,,; is the Kronecker delta, which takes the value 1 
if i = j but the value 0 otherwise. Suppose now that 


a=Yae, b= Y Biey 
Then, using (49.2) and (49.3) we find that 
a-b= ( a ;e;) ° Q, Bje;) 
an X 8; (e;* e;) 
PE atA 
d aBi 


i 


018, +0282 +0383. 
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Further, 
[a]? =a-a = af +a3+e3 = 0. 


Thus, || a || = 0 if and only if #, = «, = a; = 0, that is, 
| a|| = Oifand only ifa =o. 


It is to be noticed that this last statement, which merely 
states that the zero vector is the only vector of zero length, 
depends strongly on the fact that the space of geometrical 
vectors is over the real field. Were complex scalars permitted 
then we could not argue that «?+43+«3 = 0 implied that 
C= a, =e 0. 

It is easy to generalise the idea of scalar product to 
arbitrary vector spaces over the real field. This can be done 
by associating with each ordered pair of vectors a, b a real 
number a: b, which is subjected to the conditions expressed 
in (49.1), (49.2) and (49.3). For some purposes, this defi- 
nition is too general, and the following additional restric- 
tions are imposed: 


a-a20 (49.4) 
a:a= Oif and only if a = o. (49.5) 


As we have seen, these conditions are satisfied in the space 
of geometrical vectors. 

In some of the applications of the theory of vector spaces 
it has been found useful to extend the idea of a scalar pro- 
duct to vector spaces over the complex field. In this case, 
the commutative law (49.1) is replaced by 


a‘b=b-a, 


where b- a denotes the complex conjugate of b- a. In the 
case of the real field, this identity is equivalent to (49.1). 
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A unitary space is a vector space over the complex field 
or some subfield of the complex field in which, for every 
ordered pair of vectors (a, b) a complex number, denoted by 
a+b and called the scalar product or inner product of a and 
b, is defined and satisfies 


a‘b=b-a, (49.1*) 
a: (b+c) = (a: b)+(a‘c), (49.2) 
(Aa): b = Aa: b), (49.3) 
a‘a20, (49.4) 
a-a— Oi and only if a = o, (49.5) 


for every scalar A and all vectors a, b, c. 

From (49.1*) it follows that a+ ais a real number, which 
assures us that postulate (49.4) makes sense. An immediate 
consequence of (49.1*) and (49.3) is that 


a> (Ab) = (Ab): a = A(b: a) = 16: a) = 1@°5). 
We may therefore extend (49.3) to read 
(Aa): b = Aa: b) = a: (Ab). (49.3*) 


Taking a hint from the space of geometrical vectors, we 
define the length || a || of a vector by the equation 


a] = +./(@- a). 
It is clear from (49.4) and (49.5) that 
2] 20 
and that 
|| a || = Oif and only if a = o. 
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These are certainly desirable properties for a length to 
possess; moreover we shall show presently that this defini- 
tion of length also leads to the triangle inequality 


Jel+]o] 2 e+e]. 
THEOREM 49.1. Jn a unitary space 
lel ele|e-4]. 
Proof. From (49.4) it follows that 
(Aa+ pb) + (Aat+ pb) = 0 


for all scalars 4 and yp. Using (49.2) and (49.3*) we can 
write this 


AX(a + a)+Aji(a: b)+ pAb: a)+ ppb: b) = 0. 


If we now choose 2=b-b and p= —a-b, then 
A= b-b, fi = —b-a, and so 


(b+ b)’(a- a)—(b- bY + aa: b)—(a- bb + bb + a) + 
+(a:b)(b- ab: b) = 0, 
or, 
(6 - b)[(b- ba: a)—(a: (b+ a)] = 0. 
If b 4 o, then 6: b > O and we have, in view of (49.1*) 
[el fei ela-ap 


where |a-b| denotes the absolute value of a:b. This 
yields the required result. If however, b = o then (a: b) = 0 
from (49.2) and the same inequality is satisfied. © 


The inequality occurring in this theorem is known as 
Schwartz’s inequality. 


Observing that a - b+ - ais twice the real part of (a - 5), 
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we see that a-b+b-aS2|a-b|. It follows from 
Schwartz’s inequality that 


(lat+ 6p? = Ja|?+2 fe] | aa 
2 (a:a)t+2/a-b\+(: 
2 (a: a)+(a:b)+(6: see b) 
aU +b): (a+b) 
= |a+o |). 


Consequently we have the triangle inequality 
Jol+] 4] 2 a+]. 


Schwartz’s inequality may be expressed in the form 


Jeb] ., 
la] jal 


In the case of a Euclidean space, that is to say, a unitary 
space over the real field, this inequality assures us that 


a:b 
-1< — fl 
Jal Te] 
and we may define the angle 6 between the vectors a and b 
by the equation 


a:b 
c= Tal del 


In the case of a unitary space, however, the expression on 
the right is a complex number and in this case it is not usual 
to define an angle between two vectors. Nevertheless even 
in the complex case we adhere to the definition that a and b 
are orthogonal vectors if and only if 


a h= 0 
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and a set of vectors e,, €2,... forms an orthonormal set if 
and only if for all values of i and j, 


e;° ej => 5 jj. 


THEOREM 49.2. Every finite dimensional unitary space has 
an orthonormal basis. 


Proof. If the space be of dimension n it has a basis of 
exactly m non-zero vectors u,,...,4u,. It is patent that 


aa 
a 


is a vector of unit length. We follow a step-by-step pro- 
cedure and suppose that at an intermediate stage we have 
successfully constructed e,,..., e, such that 


e;°e; = Oi, G@ feed, ... 552) 


and such that e, is a linear combination of u,,...,u,. We 
seek now a vector v such that 


v-e,=0, G=1,...,n) 


and such that v is a linear combination of e,,..., ,, p44 
and is therefore a linear combination of u,,...,u,4, 
Suppose that 


Vv = Uyy4—(Hyey + ... +4, €,). 
Then, for i<r, 


|! 
aS Ia sl OS 2) 
r 
= 44° ej— >: 0 Oj; 
Jj=1 
= Une * Cp Oj. 
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Therefore, if we choose 


R 
| 
= 
- 
= 
® 


we get 
Vie, G= ln .?). 
Consequently a vector v with the required properties has 


been constructed. If we now write 


Cr+1 = Tol v, 


then e,,, is of unit length and we have constructed 
C1, >> 5-47 SucH that 

€;e; = 07, GJ — loa..,r+1). 
By this method we construct e;, e,,..., e, in turn so that 

€;-@; = 0), Gf = 1,026.50): 
Suppose now that 

He; +... +0,€, = O. 
Then 
0=0-e, aes (e;° AN = UTS te 

Since our hypothesis entails that each a; vanishes, we con- 


clude that e,,...,e, must be rleven an} vectors. Since 
there are n of them, they form a basis for the space. © 


Consider now an n-dimensional unitary space and choose 
an orthonormal basis e;,..., e,. Any vector a of this space 
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may be expressed in the form )' a,e;. Then 


a-b =(¥ ae,)- (d) Bje;) 
= a; (e;° e;) 
= » 089i; 


i,j 


py 0B. 


This is the formula which reduces to 


a:b = 048, +%28,+43B, 


in the case of the space of geometrical vectors. 
If, as before, we set up the correspondence 


@ > Ay [1015.. veel: 


regarding a, as a row vector, it will be seen that the inner 
product a- b is just the matrix product 


abe 


where 5? is the transposed conjugate of the row vector b,: 
that is, where 57 is the column vector 


By 
B, 


B, 
The foregoing discussion has shown us that the most 


general finite dimensional unitary space over C is only a 
slight generalisation of the familiar space of geometrical 


§ 49 VECTOR SPACES 181 


vectors. In the first place we have generalised from 3 ton 
dimensions and secondly we have generalised from the real 
field to the complex field. That nothing more is involved is 
a little surprising when we recall the abstract nature of our 
assumptions. The vectors of our vector space were not 
defined with any geometrical properties; only algebraic re- 
lations between them were specified. Likewise the definition 
of the inner products of our unitary space did not involve 
the concepts of length or angle; only algebraic properties 
were stipulated. 

The theory of infinite dimensional unitary spaces lies 
beyond the scope of this book. Such spaces are nevertheless 
of great practical importance since they find application 
in Quantum Theory. We content ourselves with one simple 
illustration of such a space. 


Example 49.1. Consider the vector space of all real 
functions of x which vanish at x = 0 and at x = a and 
which are continuous and of bounded variation in the range 
O<x<n. It is readily verified that these functions satisfy 
the axioms of a vector space over the real field R. The 
vectors of this space are those functions which have the 
above properties. To convert this into a unitary space (in 
this case a Euclidean space since the field is R, a subfield of 
C) we define the inner product of two functions f and g 
to be 


jon | “Flo e(%) dx. 


It can be verified that this satisfies our definition of an inner 
product and so these functions are the vectors of a unitary 
space. We can, however, find as many of these functions as 
we please which are linearly independent, for example the 
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functions: f, (m= 1, 2, 3, .. .) suchthat 


Si(x) = x(x—7), fo(x) sar x*(x—7), 0.0.0: 
BP G2 eae or 8 eae 


For if the first n of them were dependent but the first n—1 
were independent, then we would have an equation, valid 
for all x such that O<x<z, of the form 


A,x(x—m)+A,x7(x—2) +... $A,x"(x—2) = 0 


in which 4, 4 0. This is an equation of degree n+1 and 
has at most n+ 1 distinct roots. It cannot therefore be satis- 
fied by all values of x which lie between 0 and z. Since our 
assumption leads to a contradiction we conclude that this 
unitary space is not of finite dimension. 

Furthermore, we can construct an infinite orthonormal 
system of functions (or vectors) e,, e2,... such that 


2\t 
é,(x) = (=) sin rx, 
Tt 


with the property that 


2 | ” (x) e,(2x) dx 


0 
D n 
== sin rx sin sx dx 
m Jo 
— oO 


Furthermore the theory of Fourier series tells us that 
any function a of this space can be expressed in terms of an 
infinite series by 


a(x) = B, sin rx, 
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a a 2\t 
where the coefficients 8, are real. Writing 8, = a,{—} we 
T 


could express the above condition in the more suggestive 
form 


oo 
a=) a¢,. 
1 


To determine the coefficient «, we have only to form the 
inner products of both sides of this equation with e,. Thus, 


Hence, 


& 
i 
a 
S 
] 


if a(x) e,(x) dx 


0 


= (=) Ih a(x) sin sx dx 
3 tt) 


and consequently we obtain the formula 


= = | a(x) sin sx dx, 


0 


which is familiar to readers who have studied Fourier 
analysis. 


§50. Normed vector spaces. Although the idea of a 
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vector space with an inner product is a fruitful one, it has 
been discovered that many of the interesting results obtained 
for such spaces are satisfied in a wider class of spaces, in 
which the idea of length or norm of a vector plays an im- 
portant part, but in which an inner product is not neces- 
sarily defined. Such spaces, which are known as normed 
vector spaces, play an important part in modern mathe- 
matical analysis. 

Let V be a vector space over the field F, where either 
F=R or F=C. With each vector x € V we associate a 
real number | S3 | having the following properties: 


|| x] = 0 forallxeV, (50.1) 

|| x || = 0 if and only if x = 0, (50.2) 

| ax || =| o| || x | for all xe V, and allawe F, (50.3) 
| x+y] Ss] x]+]>1- (50.4) 


Thus we have a mapping x — || x || of V into the set of real 
numbers. When such a mapping satisfies (50.1) to (50.4), 
we call || x || the norm of x. A vector space in which a norm 
is defined is called a normed vector space. 

From the results of §49, a norm can be defined in a 
vector space having an inner product by writing 


| +] = V@-»). 


It can be shown, however, that there exist norms which 
cannot be expressed in this way in terms of an inner product. 
In the case where an inner product exists, it is clear that the 
length of a vector satisfies the requirements for a norm. 


Example 50.1. Let M, ,(C) be the vector space of 2 x2 
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matrices over the complex field. Then the mapping 


b 
| oo [> mxdab leh lehled 
defines a norm in M,,(C). The mapping 


[2 5] + WdePelorelerslar 
defines another norm in M, ,(C). 


§51. Linear algebras. It is recalled that in any vector 
space the product of a vector with a scalar is defined. 
Furthermore, if the vector space be a unitary space a scalar 
product a - b of two vectors a and b is defined and this scalar 
product is, as the name implies, a scalar of the underlying 
field F. So far we have not mentioned the possibility of 
multiplying one vector with another to get a product which 
is itself a vector. Until such a multiplication has been intro- 
duced we cannot even ask the question whether some or all 
of the vectors of a given vector space form a multiplicative 
group or semi-group. 


A linear algebra L(F) over the field F is a finite dimen- 
sional vector space V(F) in which for every ordered pair 
a, b of vectors of V(F) a product vector ab belonging to 
V(F) is defined such that for all a, b, c belonging to V(F) 
and for every scalar A of F the following relations hold 


a(b+c) = ab+ac, (S11) 
(a+b)c = act+be, (51.2) 
(Aa)b = A(ab) = a(Ab). (51.3) 


The product vector ab, or as it is sometimes called, the 
EAAN 
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vector product ab, has of course quite a different meaning 
from that of the scalar product denoted by a’ b. The opera- 
tion of forming the vector product is in fact a binary 
operation in L(F). It is to be noticed that the commutative 
law ab = ba is not in general true. For this reason it is 
necessary to include both distributive laws (51.1), (51.2) 
as postulates. In certain cases, however, in particular when 
the commutative law holds, each of these can be deduced 
from the other. It should also be observed that in general 
we do not even demand the validity of the associative law 
of vector multiplication. 

Suppose that a, b are two vectors of a linear algebra and 
that in terms of some basis e,,..., e, we have 


a=) ae; b= ¥ B,e;. 


With the help of the three defining postulates (51.1), (51.2) 
and (51.3) it is easy to show that 


ab = )' dD. a,B jee). 
fg 


In view of this formula it is clear that any product vector 
ab is uniquely specified provided a rule is available for 
calculating all products of the form e,e;. Each product e,e, 
is, of course, a vector of the linear algebra and is conse- 
quently a linear combination of the basic vectors. That is 
to say, a set of n? relations of the form 


k 
ee; = » Viz ek 


must exist in which the three-indexed coefficients yi, are 
scalars of F. The suffix k is written above the suffixes i and 
j for convenience. It does not indicate a power. Once the 
n° coefficients yi; are known the product of any two vectors 
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a and b is uniquely determined by the formula 


ab =) 0; By Yi; en 


t, 3,6 
The coefficients yi; are called the constants of structure for 
the basis e,,..., e, of the linear algebra concerned. 


Before proceeding further we cite a few examples of 
linear algebras, with some of which the reader is no doubt 
already familiar. 


Example 51.1. The set C of all complex numbers forms 
a two-dimensional linear algebra over the real field R. If 
we write e, = 1, e, = i then e,, e, form a basis of C since 
every complex number can be expressed in the form 
%,e€,; +a2e, in which «,, «, are real numbers. Multiplication 
of two complex numbers is then determined by the following 
relations 


2 = = os 
Cy =-€y, €14€2 = €2€, = C2, €2 = —@4; 


from which the eight constants of structure are easily seen 
to be 


Vit = i; 12 = 0, 21 0, 22 al 


wi=0, yiz=1, yun=1, yo2=9. 


An alternative basis such as f; = 1+i, f, = 1—i might, 
of course, have been chosen, in which case the constants of 
structure 6f, would be quite different. For instance 
f? =2i=f,-f:, from which 07, = —1. As we know, 
multiplication of complex numbers is both commutative 
and associative. 


Example 51.2. The set of all m xm matrices over a field 
F forms a linear algebra of dimension m’. If e;; denotes the 
matrix with +1 in the ith row and jth column and with 0 
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in every other position, then a typical matrix a can be 
written 


a= oz Oe; jj; 
t, Jj 


That is to say, a is the matrix with the scalar «;; in row i and 
column j. Clearly the m* matrices e;; form a basis and 
matrix multiplication is defined by the relations 


Cea = O jx i 


In this case there are (m*)° constants of structure with the 


values 


VED an = Oj Sip Dig: 
In this linear algebra, multiplication is associative but is 
not commutative. 


Example 51.3. Though we have not mentioned it earlier 
a vector product, usually denoted by axb, of two geo- 
metrical vectors is defined. Though such a product may be 
defined geometrically it will be sufficient for our purpose to 
say that if e,, e,, e; is an orthonormal basis for the vector 
space, the vector product of a = a,e,+,e,+a3e, and 
b = Be, +fe.+f3e; is given by 


axb = (4,B3—a382)e, + (4381 — %1B3)e2 + (0182 — 2B, )e3. 


It may be verified that this definition satisfies the postulates 
(51.1), (51.2) and (51.3) and consequently forms the vector 


space into a linear algebra. As special cases of the above 
formula we get 


€; Xe, = €, X€2 = €3 X€3 = 0, 
€2 X€3 = —€3Xe, = ey, 
€3 Xe; = —e; X€3 = e2, 
€; Xe, = ~—e2 Xe, = C3. 


§ 51 VECTOR SPACES 189 


Consequently the constants of structure are all zero with 
the exception of those specified by the equations 


The linear algebra of geometrical vectors is neither com- 
mutative nor associative, though the anti-commutative law 
axb = —bxais satisfied for all vectors a, b. In particular 
axa =o for every a. 

We have already mentioned that in the case of a linear 
algebra in which multiplication is commutative, the two 
distributive laws (51.1) and (51.2) are equivalent to one 
another. This is again true when multiplication is anti- 
commutative; thus if ab = —ba, then (51.1) is satisfied if 
and only if (51.2) is satisfied. In particular, this holds in the 
linear algebra of geometrical vectors. 

It should be mentioned here that the multiplication of 
two geometrical vectors to yield their vector product cannot 
be extended in an entirely satisfactory way to unitary spaces 
of higher dimensions. For a satisfactory extension to four- 
dimensional space, for instance, we require a vector product 
of three vectors and this provides an instance of a ternary 
operation (see §3). 


Example 51.4. A case of some importance both in theo- 
retical physics and abstract algebra is the group algebra of 
a finite group G. In this linear algebra the group elements 
€1,€2,... form a basis and the multiplication of the basic 
elements is just the multiplication in the group. This means 
that i, is zero unless ee; = e,, in which case yj, takes the 
value 1. This linear algebra contains not only the group 
elements themselves but also linear combinations of them as 
elements of the algebra. Such linear combinations are hard 
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to visualise unless one thinks of the group elements as per- 
mutations which act as operators. In the case of the group 
of order two with only the identity permutation e, and the 


permutation e, = as group elements, a typical ele- 


Le 

ies 
ment of the algebra would be o,e,+a,e,. Operating on 
the function f/(x,, x.) with this element would yield the new 
function a, f(x, X2)+%2f(x2, x). Problems concerning 
symmetry can be attacked by means of a group algebra. For 
instance, the function f(x,;, x2) is symmetric in the two vari- 
ables x,, x, if and only if it is annihilated by the element 
e€,;—e, of the group algebra. A group algebra is always 
associative but is not commutative unless the group is 
abelian. 


Provided that the vector product operation is associative, 
the linear algebra forms a ring, since we already know that 
its additive group is abelian and that the distributive 
laws hold. Thus Example 51.2 gives rise to a ring of 
matrices over a field F, while Example 51.4 yields what is 
known as the group ring of G. Again the complex field of 
Example 51.1 is a ring. The set of all geometrical vectors is 
not a ring with respect to the vector multiplication defined 
in Example 51.3, since this is non-associative. 

It will be observed also, as in Example 51.3, that a 
multiplicative identity need not exist, and even if it does, as 
in Examples 51.2 and 51.4, an element may have no inverse. 

It will readily be appreciated that the validity or other- 
wise of the commutative and associative laws is determined 
by the values of the constants of structure. Thus, if a linear 
algebra is commutative, then ab = ba for all vectors a, b, 
and in particular e,e; = eje;. Thus 


Vy = The (51.4) 
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Conversely, if this relation holds for all i, j, k, then e,e jp = ee; 
and 


eer > bes= ) d aBiee, — ) Y Brre,e 
i j od) 
= » Be; y “ei = ba. 
j i 


Similarly an algebra is associative if and only if e,(e;e,) = 
(e,e;)e,. Now 


e,(e ;e,) a a( yy Vin «) = y yh Vin ep 
and 
(e;e ex = ( » Vi ese = » 2 vi, Ve Cp. 


It follows from the independence of the basic elements 
€1,-.., , that the algebra is associative if and only if 


ys up py vhs Vik (51.5) 


for all h, i, j, k. 

The n? scalars yf,, in which i is a fixed integer while j 
and k range from 1 to n, may be arranged in a square 
matrix, k denoting the row and j the column. Call this 


matrix T;. Then 
r= E vee = 
' Vit CP Yin 


Consider the product of the matrices [; and T;. The ele- 
ment in row A and column k of the product is, according to 
the rules of matrix multiplication, 


DY vi Vine 
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If the linear algebra is associative, then by (51.5) this matrix 
element may also be written 


py Vi hes 


which is the element in row A and column k of the matrix 
>, vi;0}. Accordingly, 
if 


rr; = ye vy Th. 
It will be observed that the multiplication table for the 


matrices T,,...,1I°, is identical with that for the basic 
vectors e,,..., €, Which takes the form 


i 
e;e; = » Viz er 
Corresponding to any element 
() — iin asap ares4e- 


of the given linear algebra, we now construct the matrix T, 
defined by 


1 ae = 0+ zicenye +0,0 4s 


which has ¥° «yj, in row j and column k. Observe that our 
i 


notation has been slightly modified in that [,,..., I, and 
r,,,.-.-,1T., now denote the same 7 matrices. 
Since, as has already been shown, 


ab = a oB iyi. 


ase 
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it follows that 


Dy = > a8 ul 


i, j,k 
= a ab TT; 
J 
i j 
= ry. 
Furthermore, it follows at once from our definition of I, that 
Po+p = +1, 
and that 
| Bye = Ale 
A matrix representation of a linear algebra L(F) is a 


mapping a > M, of L(F) onto a linear algebra of square 
matrices of some fixed order over the same field F, such that 


Ma ma M,M,, NEAR: oa Vive, AM, = MiGs 


a 


for all a, be L(F) and all Ae F. Such a representation is a 
faithful representation if the mapping is one-one. 

We have in fact just shown that any associative linear 
algebra has at least one matrix representation, namely 
a-—>T,. We shall next show that this is a faithful repre- 
sentation provided the algebra possesses a multiplicative 
identity 1 with the property that al = a for all a. Suppose 
that 2. — ie Then 


(a, —f) ict soe s +(as— 2) 1 > O, 


where O is the zero matrix. Equating corresponding matrix 
elements, we find that for all values of j and k 


(0; —B1) Yiyt - ++ +@n—Bn) Yay = 0. 
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It follows that 


X {1 Bi) vist --- + Oa Ba) Taser = 9 
or 
(a,—B,) eye; +... +(0,—B,) ene; = O. 
Thus, for all j, 
(a—b)e; =0. 


If, however, L(F) has a unity 1 of the form )) ¢,e,, then 
j 


a—b = (a—b)l = ¥ e(a—b) e; = 0, 


J 


and so a = b. In this case the mapping is clearly one-one 
and therefore provides a faithful representation as was 
claimed. 

If an associative linear algebra of dimension n does not 
possess an identity 1, the algebra can be augmented by 
inserting an additional basic element 1 with the properties 


1? = I; le; = e;l = @). 


The original algebra is now embedded in the augmented 
algebra of dimension n+ 1 with basic elements 1, e),..., &y 
It can be proved that a faithful representation of the aug- 
mented algebra contains a faithful representation of the 
original algebra, and using this we can show that every 
associative linear algebra has at least one faithful matrix 
representation. 

The above ideas can be expressed in a different way, 
without making use of bases. Suppose that L(F) is a linear 
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associative algebra. If a is a fixed element of L(F), we shall 
denote by A, the mapping of L(F) into itself defined by 


NAO == AX. 


Then it is easily shown that A, is a linear mapping of L(F) 
to L(F). We consider the set of all such mappings, as a 
varies over L(F). We observe that these mappings satisfy 
the following properties: 


Ag+ = Mat Ap; 
oe = Avy; 
Ag = A,A;. 


Here A,+A, denotes the mapping defined by 
(A, +A,)Q) = A(x) + A,(), 
AA, denotes the mapping defined by 
(AA, )(x) = AA,(x) 


and A,A, is the composition product. 
In fact, the mappings A, form a linear associative algebra 
over the field F. We shall denote this algebra by L*(F). 
The structures of L(F) and L*(F) are closely related. 
Let h: L(F) > L*(F) be the mapping defined by 


h(a) = A,. 


Then we can easily see that 4 is a homomorphism of L(F) 
into L*(F): that is, 


h(a+b) = h(a)+h(b) 
h(ja) = Ah(a) 
h(ab) = h(a) h(6) 
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for alla, b€ L(F) and A¢ F. This homomorphism is known 
as the regular representation of L(F). In general, a repre- 
sentation of L(F) is a homomorphism of L(F) into a linear 
algebra whose elements are linear mappings of some vector 
space over F. If the homomorphism is a one-one mapping, 
then we call the representation faithful. We now show that 
if L(F) contains a multiplicative identity 1, then the regular 
representation is faithful. For suppose that A, = A,. Then 
ax = bx for all xe L(F). In particular, if we put x = 1, 
we get a= b. Hence the mapping a — A, is a one-one 
mapping. Since the regular representation always maps 
L(F) onto L*(F), it follows that in this case the algebras 
L(F) and L*(F) are isomorphic: that is, there is a one-one 
homomorphic mapping of L(F) onto L*(F). 

It should be observed that in this approach we do not 
use the assumption that L(F) has finite dimension; thus we 
have a more general result than that proved earlier. 


Example 51.5. In quantum mechanics certain operators 
associated with electron spin occur. These operators, which 
we shall calle,, e5, e3, though in quantum mechanics they are 
usually denoted by o,,0,,¢,, have the following multipli- 
cative properties: 


Ce, = 1 (r = ly 2 3) 
ee, = ie, = —€é,, 


if r,s, fis an even permutation of 1,2,3. Writing e, for 1, 
an associative linear algebra of dimension 4 with basic ele- 
ments €9, €;, €,€3; can be constructed. The appropriate 
constants of structure are determined by the foregoing rules 
of multiplication. We leave it as an exercise for the 
reader to show that the appropriate representation is 
given by 
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e—ero 0101 T,=10 1 001, 
it 020 0006 
O10" 1-0 00 0-i 
0001 On 0070 

Powe on 0 |, Tiel Oo 0 0 1 
0 0 7 Ono? 0 
0: 6 0 07100 
0708 iO 0c 


The 4 x4 matrices exhibited above are closely related to the 
Pauli spin matrices 

Or 1 | 

1 0 


My, = SeOo Tk M, 
0 1 
1 O 
0 -1 


M, 
which give a faithful representation of the same algebra 
using 2 x2 matrices. Further details, however, are outwith 
the scope of this book. 


I 
=< 
o«. O 
ell 
ee 
S 
I 


Exercise 51.1. Let L(F) be an n-dimensional linear 
associative algebra with an identity 1. Prove that any ele- 
ment a of L(F) satisfies a polynomial equation 

pa) = Agl+AyatA,a7 +... +4,-1a° *+a" = 9, 
where p is the polynomial Ag+A,x+... +A,-px7 142", 
such that if qg ¢ P(F) and satisfies g(a) = 0, then p is a 
factor of g. Show also that every element of L(F) is either 
a divisor of zero or has an inverse with respect to multi- 
plication. 
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§52. Lie algebras. In this section we consider briefly 
certain non-associative linear algebras known as Lie alge- 
bras. For these algebras it is convenient to denote the 
product of two vectors a,b by axb. In place of the asso- 
ciative law, we assume that, for all vectors a, b, c, we have 

ax(bxc)+bx(cxa)+cx(axb) = 0 (52a) 

which is known as Jacobi’s identity. We also assume that 

axa — 0: (52.2) 

for all vectors a. If we replace the vector a occurring in 
(52.2) by a+b, we get 


o = (a+b) x(at+b) = axataxb+bxatbxb 
o+axb+bxat+o 


and therefore 


axb+bxa=o (52.3) 


for all vectors a,b. This shows that in any algebra which 
satisfies (52.2) for all a, we also have (52.3) for all a, b. 
Conversely, if (52.3) is satisfied for all a, b then in particular 
it holds for b = a so that 


2axa = 0. 


This implies that (52.2) is satisfied unless F has character- 
istic 2. Thus (52.2) and (52.3) are equivalent conditions 
except in the case of an algebra over a field of character- 
istic 2. 

Lie algebras are of great importance since they arise in 
one way or another in different branches of mathematics. 
The set of geometrical vectors provides a familiar example. 
It has been shown already in §51 that their vector products 
satisfy (52.3). It may be verified that the definition of axb 
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there given also satisfies (52.1). Another example of a Lie 
algebra is provided by the Poisson brackets of classical 
dynamics. Lie algebras owe their name to the occurrence 
of such algebras in the theory of continuous groups, a sub- 
ject first developed by the Norwegian mathematician Sophus 
Lie. 

A Lie algebra can be constructed from any associative 
linear algebra by defining 


axb = ab—ba, 


in which ab denotes the product in the associative algebra. 
Clearly this definition satisfies (52.2). On the other hand 
the twelve associative products formed by the left side of 
(52.1) cancel in pairs, so that (52.1) is also satisfied. The 
resulting Lie algebra is trivial, however, if the original 
associative algebra is commutative, since in this case every 
Lie product a xb is the zero vector o. 
In the case of a Lie algebra (52.3) yields 


vty = 0 . (52.4) 
in place of (51.4). In the same way we get from (52.1) 
ob iret ViVi t Yay) = 0 (52.5) 


fOr all hs. 7, j,k: 

It is hopeless to look for a matrix representation a > M, 
of a Lie algebra in the sense that M,,., = M,M,, since we 
know that a Lie algebra is non-associative, while the algebra 
of matrices is associative. However let 


r; = E tee = 
Via Oe Yin 
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as before and consider the associative algebra generated by 
the matrices T,,...,T,, that is, the algebra whose elements 
are finite sums of the form 


> a, ns 


where A, is a product of the form I,,T;,...1;,. This 
algebra is not in general of dimension n since we have no 
reason to expect that the associative product II’; is necess- 
arily a linear combination of l,,...,1I°,. Consequently we 
cannot suppose that the matrices T,,..., I, form a basis 
of the associative algebra. 

However, from this associative algebra we can construct 
a Lie algebra by the method described above and this Lie 
algebra will contain certain products such as 

[xl jee TP -1 jf ,. 


We examine the matrix element in row / and column k of 
the matrix on the right. This is, using (52.4), (52.5) and 
again (52.4), 


2 Oi Ym Vn Yin) = y (yi ie Py Pea) 
= a Yh vy 
=) Th 
which is the matrix element in row h and column k of 
» vy I. 


Hence, for all i, 7 


rx, = »y vy p 
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corresponding to the relations 
e;X e; = » Vi; 1 


defining the original Lie algebra. This means that every 
Lie algebra is isomorphic with a subalgebra of a Lie algebra 
constructed from the associative algebra generated by the 
matrices T’,,...,I, and, possibly, the unit matrix /. 

We conclude by illustrating these remarks by applying 
the theory to geometrical vectors. The rules for multipli- 
cation given in the third example of §51 show that 


nea) Ol aes | C00, 1], F; = [0 =1-01. 
0-1 000 1 00 
01 0 -100 0 00 


These matrices do not form a basis for the associative 
algebra generated by them. For instance, I’? cannot be 
expressed in the form «,I,+a,F,+a313. However, the 
nine matrices I',I, do form a basis for the associative 
algebra and this associative algebra has an identity element 


I= —}(12+1241?). 


The Lie algebra constructed from the associative algebra 
contains products such as 


NRE it = ei, = T.F,+T,l, 


which are not linear combinations of [',, ',, 3 though they 
are necessarily linear combinations of the nine matrices 
IJ,. On the other hand all products of the form T, xT, 
are indeed linear combinations of I,,T,,1F3 and, in 
EAA O 
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particular, 


r, he = T,I,-T,T, = IT;, 
r,xQ; =TjT,-Ir3F, =17,, 


in analogy with the formulae 
€; Xn = C3, 2X03 = Cy, 03 XCy = Co 


for geometrical vectors. 


SOLUTIONS TO EXERCISES 


7.1 A is associative. 
7.2 A is associative in (ii) but not associative in (i) and (iii). 
7-30; 0), (0, 1), (1,0) and C, 1). 


9.1 A is not associative since (for example) 
QaAlal = (—5)Al = —54 and 
2A) = 2A0= 2; 
xAY =0 = yAx if and only if 
y = (1+,/(1+8x*)} / 4x?, x # 0. 


11.1 If + is commutative, then x-y = 0 > —y= —x, 
since each element has a unique inverse. But also —y has 
inverse y and —x has inverse x; hence, since —y = —x 


and each element has a unique inverse, we have y = x. 
If + is associative, then x—y = 0 => (x-y)+y = 0+y 
=y> xt(-yty=y>xt0=y>x=y. 


16.1 The left cosets of G relative to H, are 
fie ea, D) and) cll — 1c, 7,4). 
The left cosets of G relative to H, are 
H, = {e,c}, aH, = {a,d}, -bH, = {b,f}. 


23.2 Let E be an identity element for multiplication in the 
set of all matrices. Then 4E = A whenever AE is defined 
and EB = B whenever EB is defined. Let E be a pxq 
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matrix. If A is an nxp matrix, AE is defined and is an 
nxq matrix. Hence, since AE = A, we have p = q. 
Choosing A = [(,,, we get [(,,E = I). But [pE = E; 
hence E = Jp). 


25.3 The zero of R’ is (0, 0). We have (x, 0) (0, y) = (0, 0) 
for all x, y, so that R’ is not an integral domain. 


29.1 The non-zero elements form a group of order p—1. 
The order of any element of this group is a factor of p—1. 
Hence if a is non-zero, a?~! = 1. 

If n is not a multiple of p, then n?~! = 1 (mod p) by 
the above. If n is a multiple of p, then n’—n is clearly 
divisible by p. 


30.4 Consider the matrices of the form 


se Maso © 
0 :0°0.2.0 
020.0 ....0 
000 0 


32.2 By the definition of an ideal, all that we need to do is 
to prove that S is a subgroup with respect to addition. 
Given se S, we have xs eS for all x € R; hence, putting 
x = 0, we get OE S. Putting x = —1, we get (—l)s = 
—seéS for allseS. Hence S is a subgroup with respect 
to addition. 


32.4 The elements of Z/S, are the classes 0*, 1*,..., (n—1)*, 
where 0* = S,, 1* is the set of integers leaving remainder 
1 on division by n and so on. 
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32.5 If x e R*, then x is a finite sum of elements of the form 
Xx X2...X%, where x,ER (i =1,2,...,k). For any 
yeR, the product xy is a finite sum of elements of the 
POT eX, 0 py = Xx, X>.-. (4, ) and so xy e R*. Sim- 
ilarly yx e R*. It is easily verified that R* is a subgroup 
of R with respect to addition. 


32.6 The elements of R/R* are cosets of the form x + R*. We 
have (x, +R") (x,+R)... (x, +R") = x1 %2...%,+R* 
==" for any x,, %>,..-,%; € R and so (R/R*)* = 0. 


35.3 S3 is not a subgroup, because if pe S, and p, # 0, 
then —p ¢ S}. 


37.1 The elements of G are of the form m-+in, where 
m,neé Z. By definition 


g(m+in) = m?+n?. 
G is clearly a commutative ring with unity 1+70; since 
GcC, Gis an integral domain. It is easily verified that 
g{(m-+tin) (m' +in')} = (m? +n’) (m'? +n’?) 
> (m* +n?) = g(m+in) 


provided that m’?+n'? = 1;but m’,n’ are integers, so that 
m+n’? <1 => m' =0=n'. Thus condition (i) for a 
Euclidean ring is satisfied. 


Suppose that a, b é G and that ab~* = s+it, where s, ¢ 
are rational numbers. Choose integers m,n such that 
|s—m| < 4and |t—n | <4. Then m+ineG and 


a = b(m+in)+b{(s—m)+i(t—n)} 
= bg+r, 
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where g = m+in and r = b{(s—m)+i(t—n)}. Since 
r = a—bg, it follows that re G. Furthermore, 


g(r) = g[b{(s—m)+i(t—n)}] 
= g(b) g{(s—m)+i(t—n)} 
= g(b) {(s—m)?+(t—n)*} 


< $g(b) 
because of our choice of m, n. 
37.2 234. 
37.3 uy = —4+4x, uw, = 24427. 


37.4 u # 0; hence g(u) is defined. uv = e for some v; hence 
g(uv) = g(e) so that g(e) = g(u). But u = eu, so that 
g(u) = g(e). Thus g(u) = g(e). Since x = ex for all x, 
we have g(x) 2 g(e) = g(u). 


39.1 There are eight polynomials of degree 3 over Z,; they 
are 
a+bx+cx* +x}, 


where each of a, b, cis 0 or 1. If a = 0, then the poly- 
nomial is reducible, since it has a factor x. Thus there are 
four cases to consider. Ifb = 0 = c, orb = 1 = c, then 
the polynomial 


1+bx+cx? +x? 
has a factor 1+.x. The remaining two, namely 
l+x+x° and 14x?+x3 


are irreducible; to prove this we need only verify that 
x and 1+ x are not factors, since these are the only 
possible factors of degree one and any reducible poly- 
nomial of degree three must have a factor of degree 1. 
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Write p = 1+x+x°,g = 1+x*4+x°. The elements of 
P(Z,)/K, are expressible in the form e = [1], a = [x], 
b = [1+x], c= [x7], d= [1+x7], f= [x+x’], g= 
[l+x+x?] and 0 = [1+x+x°]. We can easily con- 
struct the addition and multiplication tables. It is also 
easy to see that b = e+a,c = a’, d = e+a’, f = ata’, 
g = e+a+a’ and that e+a+a°® = 0. 

Similarly, the non-zero elements of P(Z,)/K, are given 
by e’, a’, b’ = e' +a’, c= a?, d*=e' +a", f' = a’ +a’, 
g’ = e’+a'+a” where e’+a’*+a’> =0. The inverse 
of a’ is f’ and the elements of P(Z,)/K, can be written 
ef, gadtf, bi aft d met, =f +f2, 
ce =e'+f'+f, where e’+f’+f'? = 0. This suggests 
that the correspondence e > e’, a> f’, b> g', cD’, 
d-a, f-7d', gc’, 0-0 should be considered. 
From this we can in fact prove that P(Z,)/K, and 
P(Z,)/K, are isomorphic. 


41.1 x7, x+x7, 2xn+x?, 1+2x+x?, l+x+x?, 24+x7. Z; 
is not algebraically closed because (for example) 1+ x? 
is irreducible. 


48.1 If s;+s, = 5,+55, where s,, 5, € S, and sy, 5, € S), 
then $1 —-S, = 53-52 


Since s,—s{ eS, and s3—s,¢S, and S,S, consists 
of the zero vector, we have 


S,—-S{ = 0 = $3—Sp. 


48.5 Use Exercise 48.4. The dimension of the direct sum 
of n one-dimensional vector spaces is n; the result 
follows from Theorem 47.3. 
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51.1 Any n+1 elements of L(F) are linearly dependent. 
Hence there exist %, @,,...,%,, not all zero, such that 


Aol +ajat+... +4,a" = 0. 


Thus each aeL(F) satisfies a polynomial equation. 
Choose for p a polynomial of minimal degree such that 
P(a) = 0; suppose also that p is a monic polynomial. 
Let q be any polynomial such that g(a) = o. There exist 
polynomials s, t such that 


q = pstt, 


where either ¢ = O or the degree of ¢ is less than the 
degree of p. Since g(a) = o = p(a), we have t(a) = o. If 
t were non-zero, then t would be a polynomial of lower 
degree than p such that ¢(a) = o. This is a contradiction. 
Hence ¢ = 0, q = ps and pis a factor of q. 

If Ap # 0, then 


—(Ap *A,1+A9 1454+ ooe +ig-ta’)a — i 


so that a has an inverse with respect to multiplication. 
If Ag = 0, then 


(A,l+A,a+... +a )a = 0, 


so that a is a divisor of zero. 
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